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Abstract 

In the analysis of logic programs, abstract domains for detecting sharing and linearity 
information are widely used. Devising abstract unification algorithms for such domains 
has proved to be rather hard. At the moment, the available algorithms are correct but not 
optimal, i.e., they cannot fully exploit the information conveyed by the abstract domains. 
In this paper, we define a new (infinite) domain ShLin" which can be thought of as a 
general framework from which other domains can be easily derived by abstraction. ShLin" 
makes the interaction between sharing and linearity explicit. We provide a constructive 
characterization of the optimal abstract unification operator on ShLin" and we lift it to 
two well-known abstractions of ShLin". Namely, to the classical Sharing x Lin abstract 
domain and to the more precise ShLin^ abstract domain by Andy King. In the case of 
single binding substitutions, we obtain optimal abstract unification algorithms for such 
domains. 

KEYWORDS: Static analysis, abstract interpretation, sharing, linearity, unification. 



1 Introduction 

In the analysis of logic programs, the theory of abstract interpretation ( Cousot 
and Cousot 1979: ICousot and Cousot 1992aP has been widely used to design new 
analyses and to improve existing ones. Given a concrete semantics working over a 
concrete domain, an abstract interpretation formalizes an analysis by providing an 
abstract domain and an abstract semantics (working on the abstract domain) , and 
relating them to their concrete counterparts. An abstract domain is a collection of 
abstract objects which encode the property to analyze. The concrete and abstract 
domains are related by means of abstraction and concretization maps, which allow 
each concrete object to be abstracted into an abstract object which describes it. 
The abstract semantics, in most cases, is given by a set of abstract operators on 
the abstract domain, which are the counterparts of the concrete ones. For exam- 
ple, in the case of logic programs, one can individuate in the concrete semantics 
the main operations (unification, projection, union), and an abstract semantics can 
be specified by giving the abstract unification, abstract projection and abstract 
union operations. The theory of abstract interpretation assures us that, for any 
concrete operator, there exists a best abstract operator, called the optimal opera- 
tor. It computes the most precise result among all possible correct operators, on a 
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given abstract domain. Designing the optimal abstract counterpart of each concrete 
operator is often a very difficult task. In fact, even if the definition of the optimal 
operator for any abstract domain is known from the theory of abstract interpre- 
tation (as a composition of the concrete operator and the abstraction map), the 
hard task is to provide an explicit definition of the abstract operators and to devise 
algorithms on the abstract domain which compute them. 



1.0.1 The context 

The property of sharing has been the subject of many papers ^Jacob s and Langen 
1992 : IHans and Winkler 19921 'Muthukumar and Hcrmcncgil do 1992| Codish et al. 
1999; [Bagnara et al. 2002D , from the both theoretical and practical point of view. 
Typical applications of sharing analysis are in the fields of optimization of unifica- 
tion ( |S0ndergaard 1986]) and parallelization of logic programs (^ Hermenegildo and 
Rossi 1995). The goal of (set) sharing analysis is to detect sets of variables which 
share a common variable in the answer substitutions. For instance, consider the 
substitution {x/ f{u,v),y/g{u,u,u), z/v}. We say that x and y share the variable 
u, while X and z share the variable v, and no single variable is shared by a;, y and 
z. Many domains concerning sharing properties also consider linearity in order to 
improve the precision of the analysis. We say that a term is linear if it does not con- 
tain multiple occurrences of the same variable. For instance, the term f{x, f{y, z)) 
is linear, while /(x, f(y, x)) is not, since x occurs twice. 



1.0.2 The problem 

It is now widely recognized that the original domain proposed for sharing analy- 
sis, namely. Sharing ( jLangen 1990[ Jacobs and Langen 1992 ), is not very precise, 



so that it is often combined with other domains for handling freeness, linearity, 



groundness or structural information (see Bagnara et al. 2005 for a comparative 
evaluation). In particular, adding some kind of linearity information seems to be 
very profitable, both for the gain in precision and speed which can be obtained, and 
for the fact that it can be easily and elegantly embedded inside the sharing groups 
(see [King 1994[ ). In the literature, many authors have proposed abstract unifica- 
tion operators (e.g. ICodish et al. 19911 IHans and Winkler 19921 Muthukumar and 



Hermenegildo 1992; [King 1994 ) for domains of sharing properties, encoding dif- 
ferent amounts of linearity information. However, optimal operators for combined 
analysis of sharing and linearity have never been devised, neither for the domain 
ShLin^ ( |King^ 19W1). nor for the more broadlv adopted Sharing x Lin f Hans and 
Winkler 1992; [Muthukumar and Hermenegildo 1992D . 

With the lack of optimal operators, the analysis loses precision and might even 
be slower. The latter is typical of sharing analysis, where abstract domains are 
usually defined in such a way that, the less information we have, the more complex 
the abstract objects are. This is not the case for other kinds of analyses, such as 
groundness analysis, where the complexity of abstract objects may grow accordingly 
to the amount of groundness information they encode. 
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The lack of optimal operators is due to the fact that the role played by linearity 
in the unification process has never been fully clarified. The traditional domains 
which combine sharing and linearity information are too abstract to capture in a 
clean way the effect of repeated occurrences of a variable in a term and most of the 
effects of (non-)linearity are obscured by the abstraction process. 

1.0.3 The solution 

We propose an abstract domain ShLin'^ which is able to encode the amount of non- 
linearity, i.e., which keeps track of the exact number of occurrences of the same vari- 
able in a term. Consider again the substitution 9 = {x/ f{u,v),y/g{u,u,u),z/v}. 
Intuitively, to each variable w in the range of the substitution, we associate the 
multiset of domain variables which are bound to a term where w occurs, and call 
it an w-sharing group. For instance, we associate, to the variable u, the w-sharing 
group {x, y, y, y}, to denote that u appears once in 6{x) and three times in 9{y). To 
the variable v, we associate the w-sharing group {x,z}, to denote that v appears 
once in 9{x) and once in 9{z). Then we consider the collection of all the multisets 
so obtained {{x, y, y}, {x, z}}, which describes both the sharing property and 
the exact amount of non-linearity in the given substitution. The domain we obtain 
is conceptually simple, but cannot be directly used for static analysis, without a 
widening operator (jCousot and Cousot 1992c)) . since it contains infinite ascending 
chains. However, in this domain the role played by (non-)linearity is manifest, and 
we can provide a constructive characterization of the optimal abstract unification 
operator. The cornerstone of the abstract unification is the concept of sharing graph 
which plays the same role as alternating paths ( |S0ndergaard 1986} [King 2000| for 
pair-sharing analysis. We use sharing graphs to combine different w-sharing groups 
during unification. The use of sharing graphs offers a new perspective for looking 
at variables in the process of unification, and simplifies the proofs of correctness 
and optimality of the abstract operators. 

We prove that sharing graphs yield an optimal abstract unification operator for 
single binding substitutions. We also provide a purely algebraic characterization 
of the unification process, which should help in implementing the domain through 
widening operators and in devising abstract operators for further abstractions of 
ShLin"^. 

1.0.4 The applications 

We consider two well-known domains for sharing properties, namely, the reduced 
product (jCousot and Cousot 1979]) Sharing x Lin and the more precise domain 
ShLin^ by Andy King, and show that they can be immediately obtained as abstrac- 
tions of ShLin". By exploiting the unification operator on ShLin'^, we provide the 
optimal abstract unification operators, in the case of single binding substitutions, 
for both domains. We show that we gain in precision w.r.t. any previous attempt 
to design an abstract unification operator on these domains. This is the first time 
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that abstract unification has been provided optimal for a domain including sharing 
and linearity information. 

Surprisingly, the optimal abstract operators are able to improve not only aliasing 
and linearity information, but also groundness. We show that, in certain cases, we 
improve over Pos (Armstrong et al. 1994). This is mainly due to the fact that our 
operators exploits the occur-check condition. As far as we know, there is no abstract 
unification operator in the literature, for a domain dealing with sharing, freeness 
and linearity, which is more precise than Def for groundness. 

Unification for multi-binding substitutions is usually computed by considering 
one binding at a time. For instance, the unification of a substitution 9 with {xi/ti, 
x^jti^ ■ ■ ■ jXn/tn} is performed by first computing the unification of 6* with {xi/^i}, 
and then unifying the result with {x2/t2, . . . , a;„/t„}. Actually, computing abstract 
unification one binding at a time is optimal in ShLin'^ (j Amato and Scozzari 2005|) . 
We show that this is not the case for ShLin^ and Sharing x Lin. This means that 
the classical schema of computing unification iteratively on the number of bindings 
cannot be used when looking for optimality with multi-binding substitutions, at 
least with these two domains. 



1.0.5 Structure of the article 

In Section [2] we recall some basic notions and the notations about substitutions, 
multisets and abstract interpretation. In Section [3] we briefly recall the domain of 
existential substitutions and its operators, which will be used throughout the article. 
In Section [5] we define the domain ShLin", together with the unification operator, 
we show the optimality result and give an alternative algebraic characterization of 
the unification operator. In Section [5] we exploit our results to devise the optimal 
unification operators for ShLin^ and Sharing x Lin, in the case of single binding 
substitutions. Section [S] gives some evidence that there are practical advantages in 
using the optimal unification operators for ShLin^ and Sharing x Lin. In Section 
[7] we compare our domains and operators with those known in the literature. We 
conclude with some open questions for future work. The proofs of the main results 
of the paper are in Appendix \^ and the proofs of the results in Section [5] are in 
Appendix |B] 

The paper is a substantial expansion of (jAmato and Scozzari 2003)) , which intro- 
duces preliminary results of optimality for domains involving sharing and linearity 
properties. 



2 Notation 

Given a set A, let p(A) be the powerset of A and pf{A) be the set of finite subsets 
of A. Given two posets {A,<a) and {B,<b), we denote hy A^B the poset of 
monotonic functions from A to B ordered pointwise. We use <a^b to denote the 
order relation over A—>B. When an order for A or _B is not specified, we assume 
the least informative order [x < y <J=> x = y). We also use A ttl i? to denote 
disjoint union and \A\ for the cardinality of the set A. 
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2.1 Terms and substitutions 



In the following, wc fix a first order signature and a denumerablc set of variables V. 
Given a term or other syntactic object o, we denote by vars{o) the set of variables 
occurring in o and by occ{v,o) the number of occurrences of v in o. When it does 
not cause ambiguities, we abuse the notation and prefer to use o itself in the place 
of vars{o). For example, if t is a term and x € V, then x €: t should be read as 
X € vars{t). 

We denote by e the empty substitution, by {xi/ti, . . . , x„/t„} a substitution 
9 with e{xi) = ti ^ Xi, hy dom(6') = {a; € V | 0{x) ^ x} and rng(6') = 
V^xeAom(6)'vars(Q{x)) the domain and range of Q respectively. Let vars{d) be the 
set dom(6') U rng(6') and, given U G P/(V), let 9\u be the projection of 6 over U, 
i.e., the unique substitution such that 0\u{x) = 0{x) if x G U and 0\u{x) = x 
otherwise. Given 9i and 02 two substitutions with disjoint domains, we denote by 
01 W 02 the substitution such that dom{0) = dom(0i) U dom(02) and 0{x) = 0i{x) 
if X G dom{0i), for each i G {1,2}. The application of a substitution ^ to a term t is 
written as t0 or 0{t). Given two substitutions and 5, their composition, denoted 
by ^ o ^, is given by {0 o S){x) = 0{5{x)). A substitution is idempotcnt when 
9 o = 01, equivalently, when dom(6') fl rng(0) = 0. A substitution p is called 
renaming if it is a bijection from V to V (this is equivalent to saying that there 
exists a substitution such that p o ~ p~^ o p = e). Instantiation induces 
a preorder on substitutions: 9 is more general than 5, denoted hy 5 < 0, if there 
exists a such that u o ^ = (5. If w is the equivalence relation induced by <, we 
say that cr and 9 are equal up to renaming when a Ki 0. The sets of substitutions, 
idempotent substitutions and rcnamings arc denoted by Suhst, ISubst and Ren re- 
spectively. Given a set of equations E, we write a = mgu{E) to denote that cr is a 
most general unifier of E. Any idempotent substitution cr is a most general unifier 
of the corresponding set of equations Eq(cr) = {x = a{x) \ x G dom(iT)}. In the fol- 
lowing, we will abuse the notation and denote by mgu(c7i, . . . , cr„) the substitution 
mgu(Eq((7i) U . . . UEq(cr„)), when it exists. In spite of a single binding substitution 
{x/i) we often use just the binding x/t. In the rest of the paper we assume that a 
binding x/t is idempotent, namely,that x ^ vars{t). 

A position is a sequence of positive natural numbers. We denote with E the set 
of all positions and with N+ the set of all positive natural numbers. Given a term 
t and a position ^, we define t{^) inductively as follows: 



For any variable x, an occurrence of x in f is a position ^ such that t{^) = x. 

In the rest of the paper, we use: U, V, W to denote finite sets of variables; 
h,k,u,v,w,x,y, z for variables; t for terms; f,r,s for term symbols; a,b for con- 
stants; 7], 9, cr, i5 for substitutions; p for renamings. 



t{e) 



t 



(where e denotes the empty sequence) 
ti{C) at = f{ti,...,tn) a.nd i<n; 




undefined otherwise. 
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2.2 Multisets 

A multiset is a set where repetitions are allowed. We denote by {[xi, . . . ^Xm^ a 
multiset, where xi, . . . , Xm is a sequence with (possible) repetitions. We denote by 
the empty multiset. We will often use the polynomial notation . . . w^", where 
vi, . . . ,Vn is a sequence without repetitions, to denote a multiset A whose element 
Vj appears ij times. The set {vj | ij > 0} is called the support of A and is denoted 
by |[AJj. We also use the functional notation A : {vi, ...,«„} ^ N, where A{vj) = ij. 

In this paper, we only consider multisets whose support is finite. We denote with 
prn{X) the set of all the multisets whose support is any finite .subset of X. For 
example, both a^c^ and a^b^c^ are elements of pm({a, 6, c}). The cardinality of a 
multiset is \A\ = Et,e|[AJl ^('")- 

The new fundamental operation for multisets is the .sum, defined as 

A W S = Au G PJJ U \lBlA{v) + B{v) . 

Multiset sum is associative, commutative and \s, the neutral element. Note that 
we also use l±) to denote disjoint union for standard sets. The context will allow us 
to identify the proper semantics of tt). 

Given a multiset A and X C [[^Jj, the restriction of A over X, denoted by A\x, 
is the only multiset B such that [[Sj] = X and B{v) = A{v) for each v £ X. Finally, 
if A G pm{X), E[x] is an integer expression and x G X, we define 

J2e[x]- E m^)-e[x] . 

For example, given a multiset A — ^5, 5, 6, 8, 8, 8}j- then J2xeA -^^ — 2 * 5'^ + 6'^ + 
3 * 82 = 278. 



2.3 Abstract interpretation 

Given two sets C and A of concrete and abstract objects respectively, an abstract 
interpretation (jCousot and Cousot 1992bp is given by an approximation relation 
o C A X C. When a o c holds, this means that a is a correct abstraction of c. In 
particular, we are interested in the case when (^4, <a) is a poset and a <a a! means 
that a is more precise than a' . In this case we require that, if a o c and a <a a', 



then a'oc, too. In more detail, we require what Cousot and Cousot (1992b) call the 
existence of a best abstract approximation assumption, i.e., the existence of a map 
a : C ^ A such that for all a G ^, c G C, it holds that a o c <;=^ a{c) <a a. The 
map a is called the abstraction function and maps each c to its best approximation 
in A. 

Given a (possibly partial) function / : C ^ C, we say that / : ^ ^ is a correct 
abstraction of /, and write f <> f , whenever 

aoc^ /(a) o/(c) , 

assuming that /(a) o /(c) is true whenever /(c) is not defined. We say that / : 
j4 — > A is the optimal abstraction of / when it is the best correct approximation of 
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/, i.e., when / o / and 

V/' : A ^ A. /' o / ^ / <A^A f . 

In some cases, we prefer to deal with a stronger framework, in which the domain 
C is also endowed with a partial order <c and a : C ^ A is a. left adjoint to 
7 : A ^ C, i.e., 

Vc G C.Va G A.q;(c) <a a <^=> c fCp 7(a) • 

The pair (a, 7) is called a Galois connection. In particular, we will only consider 
the case of Galois insertions, which are Galois connections such that a o 7 is the 
identity map. If (a, 7) is a Galois insertion and / ; C — + C is a monotone map, the 
optimal abstraction / always exists and it is definable as / = a o / o 7. 



3 The domain of existential substitutions 

The choice of the concrete domain depends on the observable properties we want 
to analyze. Most of the semantics suited for the analysis of logic programs are 
based on computed answer substitutions, and most of the domains are expressed 
as abstractions of sets of substitutions. In general, we are not really interested 
in the substitutions, but in their quotient-set w.r.t. an appropriate equivalence 
relation. Let us consider a one-clause program p(x,x), the goal p{x,y), and the 
following answer substitutions: 9i = {y/x}, O2 — {x/y}, O3 = {x/u,y/u} and 
6*4 — {x/v,y/v}. Although 61 and O2 are equal up to renaming, the same does 
not hold for 6*3 and 9^. Nonetheless, they essentially represent the same answer, 
since u and v are just two different variables we chose when renaming apart the 
clause p(x,x) from the goal p{x,y), and therefore are not relevant to the user. 
On the other hand, if ^3 and 64 are answer substitutions for the goal q(x,y,u), 
then they correspond to computed answers q{u,u,u) and q{v,v,u) and therefore 
are fundamentally different. As a consequence, the equivalence relation we need 
to consider must be coarser then renaming, and must take into account the set 
of variables of interest, i.e., the set of variables which appear in the goal. For 
these reasons, we think that the best solution is to use a domain of equivalence 
classes of substitutions. Among the various domains proposed in the literature (e.g. 
Jacobs and Langen 1992[ [Marriott et al. 19941 [Levi and Spoto 2003[) , we adopt the 



domain of existential substitutions (jAmato and Scozzari 2009)) , since it is explicitly 
defined as a quotient of a set of substitutions, w.r.t. a suitable equivalence relation. 
Moreover, the domain is equipped with all the necessary operators for defining a 
denotational semantics, namely, projection, renaming and unification. We briefly 
recall the basic definitions of the domain and the unification operator. 

Given 61,62 G Subst and U G p/(V), the preorder :<ij is defined as follows: 

61 diu 62 ^ 3S e Subst.Vx G U. 6i{x) = d{d2ix)) . 

The notation 61 :<u ^2 states that di is an instance of 62 w.r.t. the variables in 
U . The equivalence relation induced by the preorder ^c/ is given by: 

6I1 6*2 <^ 3p G iZen.Vx G U. 6i{x) = p{62{x)) . 
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This relation precisely captures the extended notion of renaming which is needed 
to work with computed answer substitutions. 

Example 3.1 

It is easy to check that {x/w,y/u} ^{x,y} e by choosing the renaming p = {x/w, 
w/x,y/u,u/y}. Note that is coarser than the standard equivalence relation 
there is no renaming p such that e = p o {x/w,y/u}. As it happens for ^, if we 
enlarge the set of variables of interest, not all equivalences between substitutions 
are preserved: for instance, {x/w,y/u\ 7^{u,,x,j/} £• D 

Let ISuhst^jj be the quotient set of ISuhst w.r.t. The domain ISubst^ of 
existential substitutions is defined as the disjoint union of all the ISubst^^ for 
U € p/(V), namely: 

ISubst^ = [+) ISubstr.^ . 
uepfiv) 

In the following we write [6]u for the equivalence class of 6 w.r.t. ~[/. The partial 
order ^ over ISubst^ is given by: 

[d]u ^ [0']v ^ UDVaO^vO' ■ 

Intuitively, \ff\u -< \0'\v means that 9 is an instance of 0' w.r.t. the variables in V , 
provided that they are all variables of interest of 0. 

To ease notation, we often omit braces from the sets of variables of interest when 
they are given extensionally. So we write [6]x,y instead of and instead 

of ^{x^y.z}- When the set of variables of interest is clear from the context or when 
it is not relevant, it will be omitted. Finally, we omit the braces which enclose the 
bindings of a substitution when the latter occurs inside an equivalence class, i.e., 
we write [x/y]u instead of [{xly}\u- 

3.0.1 Unification 

Given U,V G p/(V), [9i]u, [92]v G ISubstr^, the most general unifier between these 
two classes is defined as the mgu of suitably chosen representatives, where variables 
not of interest are renamed apart. In formulas: 

mgn{[9,]u, mv) - [mgu{9[,9'^)]uuv , (1) 

where 9i ~£/ 9[ G ISubst, 92 ~y 9'^ G ISubst and {U U vars{9[)) n{VU vars{9'2)) C 
U (IV. The last condition is needed to avoid variable clashes between the chosen 
representatives 9[ and ^2- Moreover, mgu is the greatest lower bound of ISubst^ 
ordered by ^. 

Example 3.2 

Let 9i = {a;/a, y/r(f 1, f 1, 172)} and 92 = {y/r{a,V2,vi),z/b}. Then 
'a\g'yi{[0i]x,y,[^2]y,z) = [x / a,y / r{a, a, v) , z /h\x,y,z , 
by choosing 6[ = 9\ and 62 = {y/r{a, w, v), z/b}. In this case we have 
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{x/a,y/r{a,a,v),z/h} ^x,y,z 

TJign{9[,62) ^ {x/a,y/r{a,a,v),z/b,vi/a,w/a,V2/v} . □ 

A different version of unification is obtained when one of the two arguments is 
an existential substitution, and the other one is a standard substitution. In this 
case, the latter argument may be viewed as an existential substitution where all 
the variables are of interest: 

mgu([6'](7,'5) = nigu([6'][/, [(5]t,ar-s(5)) ■ (2) 

Note that deriving the general unification in ^ from the special case in ([2]) is not 
possible. This is because there are elements in ISuhst^ which cannot be obtained 
as [5]yars(S) for any 5 £ ISubst (see Example 14. lOp . 

This is the form of unification which is better suited for analysis of logic pro- 
grams, where existential substitutions are the denotations of programs while stan- 
dard substitutions are the result of unification between goals and heads of clauses. 
Therefore, the rest of the paper will be concerned with the problem of devising op- 
timal abstract operators corresponding to for three different abstract domains. 
Of course, unification is not the only operator needed to give semantics to logic 
programs: we also need projection, renaming and union. However, providing opti- 
mal abstract counterparts for these operators is generally a trivial task, and will 
not be considered here. 

We want to conclude the section with a small remark about our choice of the con- 
crete domain. By adopting existential substitutions and the corresponding notion of 
unification, we greatly simplify all the semantic definitions which are heavily based 
on renaming variables apart. This is because all the details concerning renamings 
are shifted towards the inner level of the semantic domain, where they are more 
easily managed ( Jacobs and Langen 1992 lAmato and Scozzari 2009|) . 



4 The abstract domain ShLin" 

The domain Sharing x Lin is one of the best known domains in the literature 
which combine sharing and linearity information. The domain Sharing records the 
information of variable aliasing, by abstracting the substitution 9 = {x/ f{u,v), 
y/g{u,u,u), z/v} into the set {uxy,vxz}. The object uxy, called a sharing group, 
states that d{u), 9{x) and 6{y) do share some variable (the variable u in this case). 
Analogously, the sharing group vxz states that 0{v), 9{x) and 9{z) do share (in this 
case the variable v) . One of the simplest way of adding linearity information is to 
record, in a separate object, the set of variables w such that 9{w) is a linear term. In 
our example, only 9{ii) is not linear. Thus the substitution is abstracted into the pair 
{{uxyTVxz},{u,v,x,z}). Another known domain in the literature is ASub whose 
main difference w.r.t. Sharing x Lin is that it only records sharing information 
between pairs of variables. Thus, in ASub, each sharing group has at most two 
elements. Developing optimal unification operators for such abstract domains is a 
difficult task. In our opinion, this is because the gap between the substitutions and 



10 



G. Amato and F. Scozzari 



Sharing X Lin (or ASub) is too wide and the combined effect of aliasing and linearity 
is difficult to grasp. 

We solve this problem by defining a new abstract domain ShLin" which can be 
used to approximate ISubstn^. Since ShLin" has infinite ascending chains, in most 
cases it cannot be directly used for the analysis. It should be thought of as a general 
framework from which other domains can be easily derived by abstraction. In this 
sense, ShLin'^ closes the gap between the concrete domain of substitutions and 
the abstractions like Sharing x Lin or ASub. The structure of ShLin"^ has made it 
possible to develop clean and optimal abstract unification operators. From these, 
optimal operators for the simpler domains are easy to obtain, at least for single 
binding substitutions. 

The idea underlying ShLin" is to count the exact number of occurrences of the 
same variable in a term. It extends the standard domain Sharing by recording, for 
each V £ V and 9 E ISubst, not only the set {w E V \ v E d{w)} but the multiset 
Xw € V.occ{v,9(w)). 

Definition 4-1 {to-Sharing Group) 

An uj-sharing group is a multiset of variables, i.e., an element of pm(V). 
Example ^.2 

Given variables u,v,w, x,y 6 V, examples of cj-sharing groups are u^v^x^^, xyz 
and u^^vwx^y'^ . □ 

Definition 4-3 

Given a substitution 9 and a variable w 6 V, we define 

9~^{v) = Xw. occ{v,9{w)) . 

Intuitively, 9~^{v) is an cj-sharing group which maps each variable w to the number 
of occurrences of v in 9{w). 

Example 4-4 

Given 9 = {x/ f{u,u,u),y/g{u,v), z/ f{u,v,v)}, we have that 9~^{u) = ux^yz, 
9-^v) = vyz^, 9-^{w) = w, and 9-^{x) = {{|. □ 

Definition 4-5 {Gorrect Approximation) 

Given a set of variables U and a set of w-sharing groups S (i.e., S C pm(C/)), we 
say that the pair (S, U) correctly approximates a substitution [9]w \iU — W and 
for each u e V, 9~^{v)\w & S. In the following we denote by [S]u the pair {S,U) 
and write [S]u '> [9]w to mean that [S]u correctly approximates [9]w- 

Therefore, [S]ij correctly approximates [9]ij when S contains at least all the to- 
sliaring groups which may arise in 9, restricted to the variables U. Note that [6]u 
is an equivalence class of substitutions, as defined in Section [3l while [S]u is just a 
symbol to denote the pair of objects (5*, U). We prefer this notation for the sake of 
uniformity with substitutions. 
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Theorem 4-6 

The relation o is well defined. 

We can now define the domain ShLin" of cj-sharing groups. 

Definition ^.7 (ShLin") 

The domain ShLin" is defined as 

ShLin- = {[S]u I U e Pf{V), S C p„([/), 5 ^ ^ O G 5} , 

and ordered by [5i];7j [*S'2];72 iff — U2 and Si C 5*2. 

The order relation corresponds to the approximation ordering, since bigger (w.r.t 
<(j) elements correctly approximate a larger number of substitutions than smaller 
elements. The existence of the empty multiset, when S is not empty, is required 
in order to obtain a Galois insertion, instead of a Galois connection. In order to 
simplify the notation, in the following we write an object [{-SSi^ii • • ■ i Bn}]u G 
ShLin'^ as [Bi, . . . , B„]ij by omitting the braces and the empty multiset. Moreover, 
if X e ShLin", we write B e X in place of X = [S]u A B e S . 

Definition 4-8 {Abstraction for ShLin") 

We define the abstraction for a substitution [9]u as 

aA[0]u)^[{0-\v)\u\veV}]u . 

This is the least element of ShLin'^ which correctly approximates [0]u- Note that 
by the proof of Theorem 14.61 it immediately follows that a^, is well defined, i.e., it 
does not depend from the choice of the representative for [d]u- 

Example 4-9 

Given 9 = {x/r{y,u,u),z/y,v/u} and U = {w,x,y, z}, we have 0~^{u) = x'^vu, 
6/-i(y) = xyz, e-\z) = e-^v) = 9-^ix) = {{} and 6l-i(s) = s for aU the other 
variables (included w). Projecting over U we obtain a;j([6'];7) = [x'^ , xy z , w]u ■ □ 

Example 4-10 

As we have said in Section[31 we show an element of ISubstn^, namely the existential 
substitution [x/r{v,v)]x, which cannot be obtained as [5]i,ar's(<5) for any substitu- 
tion S. In fact, consider any w-sharing group B ~ '5~"'^(u)|„ars((5) G '^uji[^]vars{S))- 
Then either u ^ rng(^) and B = or m £ i'ng((5) and B{u) = 1. However, 
a{[x/r{v,v)]x) = [x'^]x and x^ does not contain any variable with multiplicity one. 
□ 

4-1 Multigraphs 

In order to define an abstract unification operator over ShLin", we need to introduce 
the concept of multigraph. We call (directed) multigraph a graph where multiple 
distinguished edges are allowed between nodes. We use the definition of multigraph 
which is customary in category theory IjMac Lane 1988^ . 
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Definition 4^.11 (Multigraph) 

A multigraph G is a tuple {Nq, Eq, srcc, tgtg) where Nq ^ and Eq are the sets 
of nodes and edges respectively, srcg ; Eq — > Nq is the source function which maps 
each edge to its start node, and tgtg. : Eq —>■ Nq is the target function which maps 
each edge to its end node. 

A labeled multigraph G is a multigraph equipped with a labelling function Iq : 
Nq — > Lq which maps each node to its label in the given set Lq- 

We write e : ni ~^ n2 & G to denote the edge e S Eq such that srcG(e) = ui 
and tgtg.(e) — n2. We also write ui ^ n2 € G to denote any edge e e Eq such 
that srcG(e) — ni and tgtg.(e) ~ n2- Moreover, with \ni ^ n2 € G\ we denote 
the cardinality of the set {e £ Eq \ srcG(e) = ni A tgtQ{e) = 712}. In the notation 
above, we omit "G G" whenever the multigraph G is clear from the context. 

We call in-degree (respectively out-degree) of a node n the cardinality of the set 
{e £ Eq I tgt(e) — n} (respectively {e £ Eq \ src(e) = n}). 

Given a multigraph G, a path tt is a non-empty sequence of nodes ni . . . such 
that, for each i £ {1, . . . , fc — 1}, there is either an edge — > rii+i £ G or an edge 
ni+i Hi G G. Nodes ni and Uk are the endpoints of tt, and we say that tt connects 
ni and n^. A multigraph is connected when all pairs of nodes are connected by at 
least one path. 

4-2 Abstract unification 

We need to find the abstract counterpart of mgu over ShLin'^, i.e., an operation 
mgu^ such that, if [S]u o [9]u: then 

mgnJ[S]u,S)omgn{[9]u,S) (3) 

for each 6 £ ISubst. Note that we are looking for an abstract counterpart to the 
mixed unification in where one of the two arguments is a plain substitution. In 
particular, we would like to find an operator which is the minimum element that 
satisfies the condition in ([31), i.e., the optimal abstract counterpart of mgu. Observe 
that, for a fixed U , the set of all the elements [S\u £ ShLin"^ is a complete lattice 
w.r.t. with the top element given by [pm{U)\u and the meet operator given by 

AJ[5,;]c/ !*£/}= ' 
for any family {[S'i][/ | i £ /} of elements of ShLin'^. Moreover, the relation o is 
meet-preserving on the left, since if [Si\u o [d]u for each i £ /, then /\^{[Si]u \ i £ 
/} o [9]u- Therefore, we may define the abstract mgu as follows 

mgnJ[S]u,5) = f\J[S']u, \ Ve.[S]u o>[e]u^ [3%, o mgnmu,S)} , 

where the definitions of o and mgu force U' to be UUvars{5). Note that this is just 
a tra nslation of the general definition of optimal operator in ( Cousot and Cousot 
1992byaJId it satisfies dsn . 

This definition is completely non-constructive. The rest of this section is devoted 
to providing a constructive characterization for mgu^^ ([S]u , S). We begin to char- 
acterize the operation of abstract unification by means of graph theoretic notions. 
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Definition 4-12 {Multiplicity of lu- sharing groups) 

The multiplicity of an w-sharing group B in a term t is defined as: 

x{B,t) = Y,occ{v,t)= J2 B{v) ■ occ{v,t) . 
veB velB]\ 

For instance, x(a^^yz^, r{x, y, f{x, y, z))) = 3- 2 + l- 2 + 4- l = 12. The meaning of 
the map x is made clear by the following proposition. 

Proposition 4-13 

Given a substitution 6, a variable v and a term t, we have that x(6'^^(w),i) = 
occ{v,9{t)). Moreover, given a set of variables U, when vars{t) C U, it holds that 

x{e-\v)\u,t) = occ{v,e{t)). 

Example 4-14 

Let B = xy'^z^ and 9 = {y/r{x,x), z/r{x,x,x)}, so that 9~^{x) = {xy'^z^}. Given 
t = s{x, z) we have 

occ{x,9{t)) = occ{x,s{x,r{x,x,x))) =4 , 

and 

X{B, t) = B(x)occ{x,t) + B{z)occ{z,t) ^ I ■ 1 + 'S ■ 1 = A . □ 

If [S]u o [9]u and we unify [9]u with 6, some of the w-sharing groups in S may 
be glued together to obtain a bigger resultant group. It happens that the gluing of 

the sharing groups during the unification of [9]u with a single binding substitution 
{x/t} may be represented by special labelled multigraphs which we call sharing 
graphs. 

Example 4-15 

Let S = {x'^,y} and U = {x. y}. Wc look for a representation of the unification 
process between any substitution 9 approximated by 5* and the binding x/r{y). We 
show that multigraphs can be easily used for this purpose. For instance, the sub- 
stitution 9 = {x/r{g{u,u,u))} is approximated by S. By unifying 9 with {x/r{y)} 
we obtain 5 = {x/r{g(u,u,u)),y/g{u,u,u)}. Note that any approximation of S on 
the variables {x, y} must include the sharing group x^y^ generated by the variable 
u. Thus, any correct approximation of the unification must also contain x^y^. 

We want to associate to any w-sharing group B in S a. special multigraph which 
represents the way the w-sharing groups in S have been merged in order to obtain 
B. The nodes of this multigraph are the w-sharing groups in S (possibly repeated 
any number of times). The following is a sharing graph for x/r{y) and S: 

( \^ 

y 

Jo 
•^1 



y 




y 

^0 
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where pedices and apices on a sharing group B are respectively the values of x(-B, x) 
and x{B,r{y)). For instance, since x(a;^,a;) = 3, then we put the pedice 3 on the 
node to mean that x is bound to a term containing 3 occurrences of the same 
variable. Symmetrically, since xi^'^ i '^{v)) = 0; then we put the apicc on the node 
X'''. The in-degree and the out-degree of the nodes reflect the values of apices and 
pedices. In this case, we have 3 out-going edges from x^ and no in-going edges. 
Moreover, the multigraph must be connected, in order to guarantee that we can 
use a single variable to form the sharing group x^y^. 

By summing the labels of all the nodes, namely, a;^ l±) y tbi y W y, we obtain the 
w-sharing group x^y^ which must appear in any correct approximation of the uni- 
fication. □ 

Given any labelled multigraph G, in the rest of the paper we assume that the 
codomain of the labelling function Iq is pm(V), the set of w-sharing groups. 

Definition 4-16 {Sharing Graph) 

A sharing graph for the binding x/t and a set of w-sharing groups 5 is a labelled 
multigraph G such that 

1. G is connected; 

2. for each node n G Nq, lain) E S; 

3. for each node n € Nq, the out-degree of n is equal to x{lG{n),x) and the 
in-degree of n is equal to x(^g(^), t). 

The resultant to -sharing group of G is 

res(G) = [+) lain) . 

neNc 



Example 4.11 

Let S = {ux^,xy,vz,wz,xyz}. The following is a sharing graph for x/r{y,z) and 
S: 




V h y Jo 

where pedices and apices on a sharing group B are respectively the value of x{B, x) 
and x{B, r{y, z)). Therefore the resultant sharing group is uvwx'^y'^z^ . □ 

It is worth noting that, given any set of w-sharing groups S and binding x/t, 
there exist many different sharing graphs for x/t and S. Each sharing graph yields 
a resultant sharing group which must be included in the result of the abstract 
unification operator. Of course, different sharing graphs may give the same resultant 
sharing group. The abstract unification operator is defined by collecting all the 
resultant sharing groups. 
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Definition ^.18 {Single binding unification) 

Let U € p/(V), S he & set of w-sharing groups with [S]u G ShLin", x/t be a 
binding, and vars{x/t) C U. The set of resultant cj-sharing groups for x/t and S is 

mgu^(5, — {res{G) | G is a sharing graph for S and x/t} . 

We hft mgUjj to an operation over ShLin"^. 

^g^uji[S]u,x/t) ^[mgu{S,x/t)]u . 

This is a particular case of the abstract unification operator, for single binding 
substitutions and vars{x/t) C U. 

Example 4-. 19 

Let S be as in Example 14. 171 The following is a sharing graph for x/r{y,y, z) and 
S: 

where pedices and apices on a sharing group B are respectively the value of x(i3, x) 
and x(i3, r{y, y, z)). Therefore ux^yz G mgu^(S', x/r{y, y, z)). Note that this sharing 
group can actually be generated by the substitution 6 = {x/r{vi,vi,V2), y/v2, z/vi-, 
u/vi,v/a,w/a} where a is a ground term. Let U = {u, w, w, x, y, z}. It is the case 
that [SJcfO [0]^ and mgu([6'];7, {x/r{y, y, z)}) performs exactly the variable ahasings 
depicted by the sharing graph. Actually mgVL{[0]u, {x/r{y,y, z)}) = [x/r(vi,vi,vi), 
y/vi, u/vi,v/a, w/a]u = [vh and r]^^{vi)\u = ux^yz. □ 

We give here an intuition of the way sharing graphs work. Assume given a set of 
w-sharing groups [S]u and a binding x/t with vars{x/t) C U . We want to compute 
[Tng\x^{S , X / t)]u ■ To this aim, for any substitution 6 approximated by [S]u-, that is, 
[S]u o [9]u, we compute a^{Tiign{[9]u , {x/t})). 

For any Bi, B2 £ S, assume that there exist ui, i;2 G V such that Bi — 0~^{vi)\u 
and B2 = 6~'^{v2)\u- When unifying 9 with the binding x/t, we use the fact that 
mgu(Eq(6') U {x = t}) = mgu({6'(a;) = 0{t)}) o 9. By Prop. [4T3l 9{x) contains 
x{Bi,x) instances of vi and x{B2,x) instances of V2- Symmetrically, 9{t) contains 
x{Bi,t) instances of vi and x{B2,t) instances of V2. 

Assume that 9{x) and 9{t) only differ for the variables occurring in them (and 
not for the structure of terms). Then, an arrow from the sharing group Bi to B2 
represents the fact that, in mgu{{9{x) — 9{t)}), one of the copies of vi is aliased 
to one of the copies of V2, i.e., that there are corresponding positions in 9{x) and 
9{t) where the two terms contain the variables vi and V2 respectively. The third 
condition for sharing graphs implies that each occurrence of vi and V2 is aliased 
to some other variable. The first condition (the sharing graph must be connected) 
ensures that all the variables corresponding to the w-sharing groups involved in the 
sharing graph are aliased to each other. In other words, given any two such variables, 
they are aliased. Although here we are only considering the case when 9{x) and 9{t) 
differ for the variables occurring in them, we will show that it is enough to reach 
correctness and optimality. The next example applies this intuition to a concrete 
case. 
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Example 4-. 20 

Consider Example 14.191 where 6 — {x /r{vi,vi,V2),y Iv2, z /v2,u/vi,v / a,w / a} and 
U — {u, w, a;, z}. Let Bi = ux"^ and B2 = xyz, thus Bi = 0^^{vi)\u and 
B2 — d^^{v2)\u- When unifying 9 with the binding x/r{y, y, z) we have that 0{x) = 
r{vi,Vi,V2) and 0{r{y,y,z)) = r{v2,V2,V2). 

Note that 9(x) contains x(ux^, x) = 2 instances of vi and x{xyz, x) — 1 instance 
of V2- Symmetrically, 9{r{y, y, z)) contains r(j/, y, z)) = instances of vi and 

xixyz,r(jj,y, z)) = 3 instances of V2. Moreover, 9{x) and 9{r{y,y, z)) only differ 
for the variables occurring in them. Thus, the three edges in the sharing graph of 
Example 14. 191 correspond to the following aliasings: 

d{r{y,y,z))=r(v2,V2,V2) 

\ t f 

9{x) ^r{vi,vi,V2) 

In particular, the last arrow from V2 to itsself, corresponds to the self- loop in the 
sharing graph. □ 

The unification operator mgu^([S'][/, x/t) can be extended to the case vars{x/t) ^ 
U . The idea is to enlarge S by including all the singletons in vars{x/t) \ U . 

Definition 4-21 {Single binding unification with extension) 

Let U G p/(V), 5 be a set of cj-sharing groups with [S]u G ShLin'^ and x/t be a 
binding. 

^E^^c^{[S]u,x/t) = mgu^([5 U | v G vars{x/t) \ U}]uuvars{x/t),x/t) . 

Note that, for a generic abstract domain, the method of extending the abstract 
object to include all the variables in the concrete substitution i5 may result in a 
non-optimal abstract unification. For example, this is what happens in the case of 
the domain Sharing, as shown in (j Amato and Scozzari 2009)) . However, we will 
prove that, in the case of ShLin'^, the abstract mgu in Definition 14.211 is optimal. 

This operator can be extended to multi-binding substitutions in the obvious way, 
namely by iterating the single binding operator. 

Definition 4. 22 (Multi-binding unification) 

We define mgu^{[S]u,S) with S G ISubst and [S]u G ShLin'^ by induction on the 
number of bindings: 

mgUt^(['5']c/,e) = [S]u , 
^g^J[S]u,{x/t}iSS) ^mguJmguJ[S]u,x/t),5) . 

It is possible to prove that mgu^([iS'][/, (5) is optimal for multi- binding substitutions 
(j Amato and Scozzari 2005]) . Since optimality of iterative multi-binding unification 
is not inherited by the abstractions of ShLin" (as we show in Section [5.3p . we 
will focus on single binding unification. In the rest of the paper, we only consider 
bindings x/t which are idempotent, namely, such that x ^ vars{t). 
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We now show that mgu^{[S]u,S) is correct w.r.t. concrete unification. We show 
correctness for multi-binding substitutions, since it is a trivial extension of the 
single binding case. In fact, composition of correct operators is still correct. 

First of all, we extend the definition of to the case when it is applied to a 

sharing group B. 

Definition 4-. 23 

Given 6 S ISubst and B an w-sharing group, we define 

e-\B) = xvGV.x{B,e{v)) . 

In order to prove the correctness of abstract unification, we need the following 
auxiliary property. 

Proposition 4-^4 

Given substitutions 9, r] € ISubst and an w-sharing group B, we have 

{voe)-\B) = 0-\v-'m . 



Theorem 4-25 (Correctness o/mgu^) 

The operation mgu^ is correct w.r.t. mgu, i.e. 

y[S]u e ShLin",(5 G ISubst. [S]u o [9]u = 



mgu^([5'][/, (5) o mgu([6i](7, (5) 



Example 4-26 

Let e = {x/r{s{u,u,u),v,w),y/v',z/w'}, 5 = {x/r{y,y,z)} and U = {x,y,z}. 
Therefore ai^{[9]u) = [x^ ,x,y, z]u- If we proceed with the concrete unification 
of [6]u with S, we have mgu{[9]u,S) = [9']u with 9' — mgu(^, 5) = r} o 9 and 
?7 = mgu(^(a;) = 9{r{y,y,z))). This gives the following results: 

r] = {v' /s{u, u, u),v/ s(u, u, u), w' /w} , 
9' = {x/r{s{u, u, u), s(u, m, u), w), y/s{u, u, u), z/w, v'/s{u, u, u),w' /w} , 

with [9']u = [9]u . Now, lot 77' be obtained from r] by replacing each occurrence of a 
variable in rng(r7) with a different fresh variable, P = rj' o9 and p be a substitution 
mapping variables to variables such that p{(3{x)) = 9'{x) for each x gU. Note that 
p is not a renaming, since it is not bijective. We have: 

V = {v/s{ui,U2,U3),v' /s{U4,U5,Ue),w' /uj} , 

P = {x/r{s{u,u,u),s{ui,U2,U3),w),y/s{u4,U5,ue), z/ur,v' /s{u4,U5,U6),w' /ur} , 
p = {ui/u, U2/U, U3/U, U4/U, ur,/u, uq/u, ur/w} . 

Following the proof, we build a multigraph G as follows: 
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Note that we have chosen to annotate every sharing group with the corresponding 
variable in vars{P{U)). This is not a sharing graph since it is not connected, but if 
we take Y = ^p~^{u)^ = {u, ui, 1*2, U3, U4, U5, uq}, the restriction of G to the nodes 
annotated with a variable in F is a sharing graph whose resultant w-sharing group 
is x^y^. □ 



4-4 Optimality of Abstract Unification 

We now prove that mgu^^ is not only correct, but also optimal for a single binding 
substitution, i.e., it is the least correct abstraction. This means proving that, given 
a set of a;-sharing groups [S]u G ShLin'^, a binding x/t, and an tj-sharing group 
B G mgu.^^{[S]u,x/t), there exists a substitution [S]u such that [S]u o [S]u and 
B s aoj{nigVL{[S\u,{x/t})). First of all, we prove optimality of mgu^^{[S]u,x/t) in 
the special case of vars{x/t) C U. Next, we extend this result to the general case. 

Example 4. 27 

Consider S = {xu, xv, y} and the binding x/s{y, y). The following is a sharing graph 
for x/s{y,y) and S whose resultant w-sharing group is x^uvy. 




We show how to find a substitution [S\u such that the w-sharing group x^uvy € 
ai^{mgn{[S]u,{x/s{y,y)})). Let U = {u,v, x,y}. For each node n of the sharing 
graph, we consider a different fresh variable Wn- Assume that the node labelled 
with xu in the upper- left corner is node 1, and proceed clockwise to number the 
other nodes. 

For each variable z G U \ {x}, we associate to S{z) a term containing all the 
variables Wi such that the label of the i-th node contains the variable z. Thus, we 
define 5{u) = r{wi) where wi correspond to the node containing u. Analogously, 
we define 5{v) ~ r{w2) and S{y) — r{w^). 

We now define d{x) in a different way, namely by replacing in s{y,y) each oc- 
currence of the variable y with a term similar to 6{y), with the difference that W3 
is replaced with the variables wi and u>2. The choice of u>i and W2 is obvious by 
looking at the sharing graph, since the first and second node are the sources of the 
two edges targeted at the node three. Therefore we obtain S{x) = s{r{wi), r{w2))- 

Summing up, we have: 

5 = {u/r{wi),v/r{w2),x/s{r{wi),r{w2)),y/r{w3)} . 
It is easy to check that [S]u o [S]u and 

mgu{S,{x/s{y,y)}) = 

{u/r{wi),v/r{wi),x/s{r{wi),r{wi)),y/r{wi),W2/wi,W3/wi} , 
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hence a(^([mgu((5, {x/s{y, y)})]u) = [x^uvylu- □ 

In the above example we have shown how to find a special substitution such that 
its fresh variables are unified according to the arrows in a sharing graph. The same 
idea is exploited in the next theorem for proving the optimality of the abstract uni- 
fication operator mgu^. For any w-sharing group X G Tiigu^{[S]u , x / 1) , we provide 
a substitution 8 obtained as in Example l4.271 such that [S]u approximates [S\ij and 
X e a^(mgu([(5]c/,{a;/i)}). 

Theorem ^.28 [Optimality o/mgu^) 

The single binding unification mgu^([S'](7, x/t) is optimal w.r.t. mgu, under the 
assumption that vars{x/t) C C/, i.e., : 

VB e uign^[[S]u,x/t) 35 e ISubst. [S]u o [5]u and B e a^(mgu([(5][/, {x/t})) . 

The previous proof requires that vars{x/t) C U. However, the same construction 
also works when this condition does not hold. 

Example 4-29 

Given U — {x, y} and S — {x^ , x^y}, we want to compute mgu^^ ( [5] , x/s{y,z)). By 
extending the domain of the variables of interests, we obtain [S']v ~ [x'^, x^y, z]x,y^z- 
One of the sharing graphs for x/s{y, z) and [S']v is 
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Following the proof of the previous theorem, we obtain the substitution 

= {x I s{r{wi),r{wx,W2,W2)),y Ir{w2),z lr{wi,w^,w^)} , 

where [S"]yo[(5']y and a:^?/z'^ € Q!a;(mgu([(5']y, {a:/s(i/, z)})). However we are looking 
for a substitution i5 such that \S\i! o [i5];7 and x^yz^ g Q;,^(mgu([<5]c/, {a;/s(?/, z)})). 
Nonetheless, we may choose (5 = (5' (or, if we prefer, 8 — 5'\x^y) to get the required 
substitution. □ 

This is not a fortuitous coincidence. We may show that it consistently happens 
every time we apply Theorem l4.28l to an abstract unification where vars{x/t) ^ U. 
Therefore, we can prove the main result of the paper. 

Theorem 4-30 [Optimality of mgri^ with extension) 

The single binding unification mgu^ with extension is optimal w.r.t. mgu. 

4-5 A characterization for resultant sharing groups 

The domain ShLin'^ has not been designed to be directly implemented, but some 
of its abstractions could. Providing a simpler definition for the set of resultant lu- 
sharing groups could help in developing the abstract operators for its abstractions. 
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We show that given a set S of w-sharing groups and a binding x/t, the set of 
resultant w-sharing groups has an elegant algebraic characterization. 

By definition of sharing graph, a set of nodes N labelled with w-sharing groups 
of S can be turned into a sharing graph for S and x/t if and only if the condition on 
the out-degree and in-degree is satisfied and the obtained graph is connected. The 
condition on the degrees says that for each node s labelled with the sharing group 
Bs, the out-degree of s must be equal to xi^s^x). Symmetrically, the in-degree 
must be equal to x{Bs,t)- As a consequence, the sum of the out-degrees of all the 
nodes J2seN x{Bs,x) must be equal to the sum of the in-degrees of all the nodes 
X^sgjv xi^s, t). This is because each edge has a source and a target node. Moreover, 
in order to be connected, any graph needs at least |A^| — 1 edges. Since the number 
of edges is equal to the sum of in-degrees of all the nodes, it turns out that such 
a sum must be equal to or greater than |A^| — 1. Surprisingly, this is enough to 
construct a sharing graph from N. 

Theorem 4.31 

Let 5' be a set of w-sharing groups and .x/t be a binding. Then B G mgu^{S,x/t) 
iff there exist n £ N+, Bi,. . . ,Bn ^ S which satisfy the following conditions: 

1. B = l±)i<i<„i3i; 

2- Ei<i<n x{Bi, x) = Ei<K„ x{Bi,t) > n - 1; 

3. either n = 1 or VI < i < n. x{Bi, x) + x{Bi,t) > 0. 

Following the above theorem, we can give an algebraic characterization of the 
abstract unification operator as follows. 

Corollary 4-32 {Algebraic characterization o/mgu^^) 

Given a set of w-sharing groups S and a binding x/t, we have that 

mgu„(5, x/t) = {S\rel{S,x,t))U 

I W7e I 7e G pm{rel{S,x,t)), ^ x{B,x) = x{B,t) >\Tl\-l[, 
I Ben Ben J 

where 

rel{S, X, t) = {BG S. x{B, x) + x{B, t) > 0} 
= {BgS. [[BJJ n vars{x/t) 0} . 

Example 4.33 

Consider S = {xa,xb,z'^,zc} and the equation x = z. Then if we choose TZ = 

§xa, a;6, wc have J2BenX{B,x) = 2 = J2BenX{B, z) > \TZ\ — 1. Therefore 
x^z^ab G mgu^(5, x/z). If wc take TZ = \^xa, xb, zc, zc^, although YliBenX{B,x) = 
2 = X^BeK x{B, z), we have \TZ\ — 1 = 3. This only proves that z^c^x^ab cannot be 
obtained by the multiset TZ. If we check for every possible multiset over S, we have 
that z'^c^x'^ab ^ mgu^^(S',a;/2:). □ 

This characterization of the abstract mgu will be the key point for devising the 
optimal abstract unification operators on the abstractions of ShLin'^ . Let a be the 
abstraction function from ISubstr^ to an abstract domain A. If we are able to factor 
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a through a Galois connection (a' : ShLin'^ ^ A, 7' : A ShLin'^) as a = a' o a^, 
then the optimal abstract unification for a is exactly Q;'(mgU(^(7'(-), •)). However, 
this expression is helpful when it may be simplified in order to use only objects in 
A. Our algebraic characterization makes the simplification feasible, as we show in 
the following section. 

5 Practical domains for program analysis 

We consider two domains for sharing analysis with linearity information, namely, 
the domain proposed in ( |King 1994] ) and the classical reduced product Sharing x 
Lin. They are defined as abstractions of ShLin'^ through Galois insertions. This 
allows us to design optimal abstract operators for both of them, by exploiting the 
results introduced so far. By composing each Galois insertion with a^,, we get the 
corresponding abstraction function for substitutions (jCousot and Cousot 1992a| 
Sect. 4.2.3.1). 

5. 1 King 's domain for linearity and aliasing 

We first consider the domain for combined analysis of sharing and linearity in ^ King 
1994^ The idea is to enhance the domain Sharing by annotating each sharing 
group with linearity information on each variable. For instance, the object xy°° z 
represents the sharing group xyz and the information that y may be non-linear 
(while X and z are definitely linear). The objects in this domain can be easily 
viewed as abstraction of w-sharing groups. Intuitively, in order to abstract an u- 
sharing groups, one simply needs to replace each exponent equal to or greater than 
2 with 00. Let us now formalize the domain as an abstraction of ShLin'^. 

An oj-sharing group (which is a multiset V — > N whose support is finite) is ab- 
stracted into a map o : V ^{0, 1, 00} such that its support [[ojj = {v E V \ o{v) ^ 0} 
is finite. We call such a map the 2-sharing group. We use a polynomial notation 
for 2-sharing groups as for w-sharing groups. For instance, o = xy°° z denotes the 
2-sharing group whose support is [[ojj = {a;,y, z}, such that o(x) = o(z) = 1 and 
o(?/) = 00. We denote with the 2-sharing group with empty support. Note that in 
( [King 1994] ) the number 2 is used as an exponent instead of 00, but we prefer this 
notation to be coherent with w-sharing groups. 

We denote min{o(a;),2} by Om(a;), where the ordering on N is extended in the 
obvious way, i.e., for all n € N we have that n < 00. A 2-sharing group o represents 
the sets 72(0) of w-sharing group given by: 

72(0) = {Be PraiV) I M = A V.X e M-oM < b{x) < o(x)} . 

For instance, the 2-sharing group xy°° z represents the set of w-sharing groups 
{xy"^ z , xy^ z , xy'^ z , xy^ z , . . .}. The idea is to use 2-sharing groups to keep track of 
linearity: if o(x) — 00, it means that the variable x is not linear in the 2-sharing 
group o. In the rest of this section, we use the term "sharing group" as a short form 
of 2-sharing group, when this does not cause ambiguity. 

An w-sharing group B may be abstracted into the 2-sharing group a2{B) given 
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by: 

I oo otherwise. 

The next proposition shows two useful properties of the maps 0:2 and 72. 



Proposition 5.1 

The following properties hold: 

1. aad+lTe) =l+)a2(7e). 

2. rel{j2{S),x,t)) =j2irel{S,x,t)). 

Since we do not want to represent definite non-linearity, we introduce an order 
relation over sharing groups as follows: 

o < o' ^ = lo'W A Va; e H- o{x) < o'{x) , 

and wc restrict our attention to downward closed sets of sharing groups. We denote 
by Sg'^{V) the set of 2-sharing groups whose support is a subset of V. The domain 
we are interested in is the following: 

ShLin^ = {[S]u I S e Pi{Sg\U)),Ue p/(V),5 ^ ^ G 5} , 

where pi{Sg'^{U)) is the powersot of downward closed subsets of Sg^{U) accord- 
ing to < and [Si]ui ^2 [S2]u2 iff C^i = U2 and C 82- For instance, the set 
{xy°°z} is not downward closed, while {xyz,xy°°z} is downward closed. There 
is a Galois insertion of ShLin^ into ShLin'^ given by the pair of adjoint maps 
72 : ShLin^ ^ ShLin'^ and 02 : ShLin'^ ShLin^: 

l2i[S]u)=[\jMo)\o€S}]^ , 

M[S]u) = [i{a2{B) I B e S}]^ . 

With an abuse of notation, wc also apply 72 and 012 to subsets of tj-sharing groups 
and 2-sharing groups respectively, by ignoring the set of variables of interest. For 
instance, 72({a;yz, xy°°z}) = {xyz, xy'^z, xy^z, xy^z, xy^z^ . . .}. 

Theorem 5.2 

The pair (02,72) is a Galois insertion. 

Now we may define the optimal mgu for ShLin^ and single binding substitutions 
as follows: 

Definition 5.3 {Unification for ShLin^) 

Given [S]u € ShLin^ and the binding x/t, we define 

mgU2 {[S\u-,x/t) = a2 (mgu^ (72 ( [S] c/ ) , ) ) . 

By construction, mgU2 is the optimal abstraction of mgu^^, hence also of mgu. In 
the case where vars{x/t) C U, by using additivity of 02 we get: 

mgU2{[S]u,x/t) = \a2{l2{S) \ rel{-f2iS),x,t))U 
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a2({W7^ I 7^e p„(re/(72(5),a;,i)), 

Y,x{B,x)^Y.^^B,t)>\n\-l})\ . (4) 



B&n Ben 

Now we want to simplify Eq. |4l In particular we would like to get rid of the 
abstraction and concretization maps and to express the result using only objects 
and operators in ShLin^. Therefore, we need to define operations in ShLin^ which 
correspond to W and x iii ShLin'^. 

The operation on 2-sharing groups which corresponds to multiset union on uj- 
sharing groups, is given by 

o l+l o' = Aw e V.o{v) ® o'{v) , 

where 0®a; = x®0 — x and oo®x — x^oo — 1®1 — oo. We will use l+)-8^oi, . . . , o„5- 
for oitt)- • -Won. Given a sharing group o, we also define the delinearization operator: 

= o W o . (5) 

Note that = Aa; e [[ojj.oo. The operator is extended pointwise to sets and 
multisets. 

A fundamental role is played by the notion of multiplicity of a sharing group in 
a term. While the multiplicity of an tj-sharing group in a term is a single natural 
number, every object in ShLin^ represents a set of w-sharing groups, hence its 
multiplicity should be a set of natural numbers. Actually, it is enough to consider 
intervals. We define the minimum Xm and maximum xm multiplicity of o in t as 
follows: 

Xfn{o,t) ^ ^ Om{v) ■ OCc{v,t) XM^O.t) ^ ^ o{v) ■ OCc{v ,t) . 

velo\i t)G|[oJl 

Sum and product on integers are lifted in the obvious way, namely, the sum is oo if 
and only if at least one of the addenda is oo and n • oo = cxd • n = oo for any n G N"*" , 
while • oo = 00 ■ = 0. The maximum multiplicity xm(o, t) either is equal to the 
minimum multiplicity Xm{o^ t) or it is infinite. Note that, if B is an w-sharing group 
represented by o, i.e., B e 72(0), then Xm(o,t) < x{B,t) < XM{o,t). Actually, not 
all the values between Xm(o, and xm(o, t) may be assumed by x(_B,t). 

Example 5.4 

Let o = and t = /(x, x). According, to our definition, x(o, t) = [4, 00). However, 
it is obvious that if _B e 72(0), then xiB,t) is an even number. □ 

According to the above definitions, we define the multiplicity of a multiset of 
sharing groups as 

x{Y,t)^{neN\ Y.Xm{o,t)<n<J2xM{o,t)} . 
oeY oeY 

Even if this is a superset of all the possible values which can be obtained by com- 
bining the multiplicities of all the sharing groups in Y, this definition is sufficiently 
accurate to allow us to design the optimal abstract unification. 
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We extend in the obvious way the definition of rel (see Corollary I4.32p from 
w-sharing groups to 2-sharing groups, i.e., 

rel{S, i) = {o e 5 I [[oJJ n vars{x/t) ^ 0} , 

and we prove the following 

Theorem 5.5 {Characterization of abstract unification for ShLin^) 

Given [S]u G ShLin'^ and the binding x/t with vars{x/t) C U , we have that 

uign^{[S]u,x/t)^[{S\S')\J 

i{[+Jy I y e p™(5'),n e x{Y,x)r\x{Y,t). n>\Y\~ l]]u , 

where S" = rel{S,x,t). 

Example 5.6 

Let S = i{0,ux°°,w°°,x°°2/,z°°} and Y = 1^°°, xy, z°°§. We have that 
x{Y,x) = {n I n > 5} and x(F, /(z,z)) — {n \ n > 4^}. Since f{z,z) contains 
two occurrences of z, the "actual" multiplicity of the sharing group z°° in /(z, z) 
should be a multiple of 2. But wc do not need to check this condition and can 
safely approximate this set with {n \ n > 4}. This works because we can always 
choose a number which is contained in both x) and x(^i s-^i^ which is an 
"actual" multiplicity. For instance, we can take n = 6 G x(X^ ^ z)) and 

since we have 6 > 3 = |F| — 1, we get that the sharing group 1+) F = uvx°°yz°° 
belongs to mgU2([5]c/, a;//(z, z)). This sharing group can be generated by the sub- 
stitution {x / f {f {u, u, y) , f (v , V , y)) , z / f (w , w , w)} when the variables of interest are 
{u,v,x,y,z}. □ 

Theorem 15.51 gives a characterization of the abstract unification over ShLin^. 
However, it cannot be directly implemented, since one needs to check a certain 
condition for each element of p,„(re/(5, x, t)), which is an infinite set. Nonetheless, 
this is an important starting point to prove correctness and completeness of the 
abstract unification algorithm which wc are going to introduce. 

The characterization in Theorem 15.51 mav be used even when vars{x/t) ^ U, if 
we first enlarge the set of variables of interest in order to include all vars{x/t). 

Theorem 5.7 (Characterization of abstract unification with extension for ShLin^) 
Given [S]u in ShLin^ and the binding x/t, let V — {ui, . . . ,w„} be vars{x/t) \ U. 
Then, 

mgU2([S']t/,a:;/t) = mgU2([S' U {vi, . . . ^VnWuvjv.x/t) . 

The previous theorem states that enlarging the set of variables of interest preserves 
optimality. 

5.2 An algorithm for abstract unification in ShLin^ 

In order to obtain an algorithm from the characterization in Theorem 15. 51 we need 
to avoid the use of pm(reZ(5', a;, f)) and to develop a procedure able to compute 
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the resultant sharing groups by inspecting subsets (not multisets!) of rel{S, x,t). In 
general, any X C rel{S, x, t) yields more than one sharing group, since every element 
in X may be considered more than once. However, since ShLin^ is downward closed, 
it is enough to compute the maximal resultant sharing groups. 

Given X C rel{S,x,t) and the binding x/t, assume that we are only interested 
in those sharing groups whose support is [[l+l-'^Jl- By joining (multiple copies of) 
the sharing groups in X, any resultant sharing group o is between 1+J X and 1+J X"^ , 
i.e., < o < where X"^ is the pointwise extension of the delinearization 

operator (see Eq. [5|). Note that, if X is badly chosen, it is possible that we are 
not able to generate any sharing group with this support. In this computation, the 
notion of multiplicity of a sharing group in a term plays a major role. 

For example, given the binding x/t, if xm(o, x) <1 for each o ^ X, then 1+J X is 
a resultant sharing group only if there is a unique sharing group o d X such that 
vars{t) n [[oJJ ^ 0. If there are 01,02 G X such that xm{oi,x) > 1 and xm(o2, t) > 1 
then 1+J X is a resultant sharing group. Moreover, we may join two copies of each 
sharing group in X, and therefore also 1+jX^ is a resultant sharing group. 

Now we can define the notions of linearity and non-linearity on the abstract 
domain. In addition, we also introduce a new notion of strong non-linearity. Given 
X C rel{S,x,t), we partition X in three subsets X^ = {o & X \ XM{o,t) — 0}, 
Xt^{oeX\ xm(o, x) = 0} and X^t = X \ {X, U Xt). 

Definition 5.8 

Given a set S of sharing groups and X C rel{S, x, t), we say that X is: 

• linear for the term t if for all o G X it holds that xa/(o, t) < 1; 

• non-linear for the term t if there exists o G X such that XM{o,t) > 1; 

• strongly non-linear for the term t if there exists a £ X such that xm(o, t) — 00 
or there exists o G X^t such that xm(o, t) > 1. 

Analogously, we define linearity and non-linearity of X for the variable x. 

Note that, if t is a variable, the non-linear and strongly non-linear cases coincide. 
We now present the algorithm for computing the abstract unification in ShLin^. 

Theorem 5.9 (Abstract unification algorithm for ShLin^) 

Given [S]u G ShLin^ and the binding x/t with vars{x/t) C U , we have 

^gn^{[S]u,x/t) = [{S\S')\Ji y res{X,x,t)]u , 

X<ZS' 

where S' = rel{S,x,t) and res{X,x,t) is defined as follows: 

1. if X is non-linear for x and t, then res{X, x, t) = {1+J X^}; 

2. if X is non-linear for x and linear for t, \Xx\ < 1 and \Xt\ > 1, then we have 
res{X,x,t) = {(1+jX.) W (I+JX^J W (J+JX^)}; 

3. if X is linear for x and strongly non-linear for t, \Xx\ > 1 and \Xt \ < 1, then 
we have res(X, x, i) = {(l±l ^J) W (l±l X.^) W (1+J Xt)}; 
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4. if X is linear for x and not strongly non-linear for t, \Xt \ < 1, then we have 

res{X,x,t) = {{\^Z)^{\^Xl,)^{\^Xt) \ Z e p^X.), 

\Z\^XM{Xut)^Xn^{Xut), 

m = XA ; 

5. otherwise res{X,x,t) — 0. 



Example 5.10 

Let U = {u, V, X, y} and consider the set of 2-sharing groups S = {0, xu, x°°,xy, yv} 
and consider the binding x/r{y, y). Note that S' — {xu, x°°,xy, yv}. Let us compute 
res{X,x,r{y,y)) for some X's, subsets of 5". 

• X — {x°°,yv}. In this case, xm{x°° ,x) = oo and XAf(yf , r(?/, y) = 2, hence 
X is non-linear for x and r{y, y). From the first case of Theorem l5.9l we have 
that res{X,x,r{y,y)) = {l+JX^} = {1+J{a;°°, y°°w°°} = {x°°y°°v°°}; 

• X — {xu, xy, yv}. Then X is linear for x and strongly non-linear for r{y, y), 
since xy G X^t and XM{xy,r{y,y)) = 2. From the third case, it follows that 
resiX, X, t) - {{{^{xu}-") W (l±|{xy}2) y {{^{yv})} = {x^y^u^v}; 

• X = {xu,yv}. Then X is linear for x and not strongly non-linear for r(y,y) 
(note that XM{yv,r{y,y)) = 2 > 1 and yv G Xt, hence X is non-linear for 
r{y,y) but it is not strongly non-linear). Since XM{Xt,r{y,y)) = 2, we only 
need to consider those Z G pm{Xx) such that \Z\ = 2. There is only one such 
set, which is Z = ^xu,xu^. Therefore res{X,x,r{y,y)) — {(\^^xu,xu^) W 
(1+J{}2) W (1+J{yi,})} = {x°°yu-«}. □ 

Note that, given X C S' , if x does not appear in any sharing group of S, then 
res{X,x,t) C {0}. In fact, we can only apply the fourth or fifth case. In the fourth 
case, we have that Xx = Xxt — 0, and thus the only Z G pm{Xx) is the empty 
multiset. Thus, \Z\ = 0, which implies that Xt — 0, and res{X,x,t) = {$}. In 
the fifth case, the result is trivially the emptyset. Symmetrically, when none of the 
variables of t appears in S, again we can apply only the fourth or fifth case, and 
res{X,x,t) C {0}. 



Example 5.11 

Consider S and U as in Example lS.lOl We compute mgU2([5']c/, x/r{y, y)). We show 
the value of res{X,x,r{y,y)) for every X C S' = rel{S,x,r{y,y)) which contains 
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X 


res(S,x,r(y,y)) 


case 


x°° , xy 


x°°y°° 


1 


x°° ,yv 


x°°y°°v°° 


1 


x°°,xy,yv 


x°°y°°v°° 


1 


x°° ,xu, xy 


x°°y°°u°° 


1 


x°° ,xu, yv 


x°°y°°u°°v°° 


1 


x°° ,xu, xy, yv 


x°°y°°u°°v°° 


1 


xu, xy 


x°°y°°u°° 


3 


xu, yv 


x^yu°°v 


4 


xu, xy, yv 


x°°y°°u°°v 


3 



Hence 

mgU2(['S']c/,a:/r(y,2/)) = 

i{0, x°°y°°v°°,x°°y°°u°°, x°° y°° u°° , x°°yu°°v, x°°y°°u°°v} □ 



The main difference between the algorithm in Theorem 15.91 and the characteriza- 
tion in Theorem l5.5l is that in the former X is a subset of 5" while, in Theorem l5.5[ Y 
is a multiset over 5". Since the number of subsets of S' is finite, the characterization 
in Theorem 15.91 is an algorithm. 

Obviously, a direct implementation of mguj would be very slow, so that appropri- 
ate data structures and procedures should be developed for a real implementation. 
Although this is mostly out of the scope of this paper, we show here that the 
definition of mgU2{[S]u,x/t) may be modified to consider only maximal subsets 
of rel{S, x,t). This should help in reducing the computational complexity of the 
abstract operator. 

Given [A]ij £ ShLin^, let max A be the set of maximal elements of A, i.e., 
max A = {a A \ $b £ A.b >2 a}. Given a sharing group o, we define the lin- 
earized version of o, denoted by l{o), as 



l{o)(v) = 



1 if w e H 

otherwise 



The linearization operator I is extended pointwise to sets of sharing groups. We 
show that instead of choosing X as a subset of S' in the definition of mgug, we may 
only consider those X's which are subsets of maxS". 

Theorem 5.12 

Given [S\u G ShLin^ and the binding x/t with vars{x/t) C U, we have 

mgn2{[S]u,x/t)^[iS\S')Ul [j {res{X,x,t)Ures'{X,x,t))]u , 



XCmax S' 



where S' = rel{S,x,t) and 



res'{X, X, t) 



{1+J if X = X^t and 1{X) is linear for t , 

otherwise . 
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The next examples compare our optimal abstract unification operator to the 
original one and show the increase in precision. 

Example 5.13 

Let U ~ {u, V, w, X, y}. Consider the set of 2-sharing groups S — {0, xu, xv, xw, y}. 
We compute mgU2([S']£/, a;/r(7/, y)). Since rel{S, x,r{y,y)) — S, we need to con- 
sider any X C S. If y ^ X then clearly res{X, x,r{y,y)) — ^. li y G X, since 
XMiy,r{y,y)) — 2, it follows that X is linear for x and not strongly non-linear for 
r{y,y)- Thus 

mgU2(['S']c/, x/r{y, y)) = [i{0, x°°u°°y, x°°uvy, x^uwy, x°°vwy, x°°w°°y}]u 

On the other hand, computing with the unification algorithm given in ( [King 1994[ ), 
the result is 

i{0, x^u°°y, x°°u°°w°°y, 

x°°v°°y,x°°v°°w°°y,x°°w°°y,x°°u°°v°°w°°y} . 

The old algorithm is not able to infer the linearity which arises when combining two 
distinct sharing groups from {xu,xv,xw} with {y}. Moreover, it does not assert 
that the variables u, v, w cannot share a common variable. □ 

Example 5.14 

Let U — {u,x,y,z} and 5* = {0, xu, xy, yz}. By computing mgU2([5];7, a;/r(y)) 
we obtain l{$,x°°y°°,x°°uy°^z}, which shows that u and z are linear after the 
unification. This is not the case when computing with the unification algorithm in 
( jKing 1994( ), since we obtain i{0, x°°y°° , x°°u°^y°° z°° , x'^u°°y°°,x°°y°°z°°}. Note 
that, we also improve the groundness information. In fact, in our result, groundness 
of u implies groundness of z. □ 

Both examples show the increased precision w.r.t. King's algorithm. In the first 
example, we obtain optimality thanks to the introduction of the notion of (non-) 
strong non-linearity. In the second example, we improve the result since we do 
not need to consider independence between x and t, in order to exploit linearity 
information. 

5.3 Unification for multi-binding substitutions 

The unification operator on ShLin^ has been defined for single binding substitu- 
tions. It is possible to extend this operator to multi-binding substitutions in the 
obvious way, namely by iterating the single binding operators. 

mgU2([^]c/,Wt}W0) = mgU2(mgU2([5]t/,a;A),e) ■ 

However, defined in such a way, mgU2 is not optimal. Consider, for example, S — 
{^jXZjyw}, U = {x,y,z,w}, and the substitution 9 — {x/r{y,y), z/w}. We have 
that mgU2([S'](7, a;/r{y, 2/)) = [| {(l>,x°°z°°yw}]u. Since x°°zyw <2 x°°z°°yw, by 
applying the third case of mgU2 ioY — {x°^zyw} we get 

mgU2(U {%,z°-x°-yw}]u,z/w) = [i {^,x^y^ z°-w°-}]u . 
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However, 

"2 (mgu^ (72 ( [{0 , 0:2: , y w}] £/ , 6») ) ) 

= a2{T!\gM^{[{xz,yw}]u,0)) 

= a2{v[vg\x^{[{wx^yz^}]u,{zlw})) 

= c^2m]u) = mu , 

which shows that nigU2 is not optimal. Note that, we do not use optimahty of mgu^^ 
to prove this result, since correctness is enough. 

The problem is that, to be able to conclude that the unification of S with $ is 
ground, wc need to keep track of the fact that, after the first binding, w is linear 
and z is definitively non-linear. Since ShLin^ is downward closed, we are not able 
to state this property. Note that, in the case we have presented here, by changing 
the order of the bindings we get an optimal result in ShLin^, but this happens just 
by accident. 

Now, consider the substitution = {x/r{y, ....,y), z/ s{y, ...,y),u/v} with S = 
{^,xu,zv,y} and U = {u,v,x,y,z}. Assume that r{y,....,y) is an n-ary term, 
s{y, y) is an m-ary term with n^m and n,m>2. We have that: 

mga2{[S]u,x/r{y,....,y)) = [mx°°u°°y, zv}]u , 
mgn2{[mx°°u°°y,zv}]u,z/s{y,...,y)) = [l{<D,x^u°°z^v^y}]u , 
mguMD, x'^u'^z°°v^y}]u,u/v) = [l{<!),x°°u'^z^v'^y}]u • 

On the other hand, we have that: 

a2(mgu^^(72([{0, xu, zv, y}]u, 9))) 
= a2{mgu^{[{xu,zv,y}]u,0)) 
= a2(mgUj^([{a;"u"2/, zv}]u, {z/s{y, y),u/v})) 
= a2{mgnMxVyzV}]u,{u/v})) 

= Mimu) = mu . 

However, if n = m, we have: 

a2(mgu^^(72([{0, xu, zv, y}]u, 9))) 
= MM}} U {a;'="u'="y'=.2'="t;'=" | k e N}]u) 
= [i{<D,x°°u°°z°°v°°y}]u . 

In this case, keeping track of the variables which are definitively non-linear does 
not help. It seems that, in order to compute abstract unification one binding at 
a time, we need to work in a domain which is able to keep track of the exact 
multiplicity of variables in a sharing group. Actually, this is how ShLin'^ works. 
Obviously, we could try to develop a different algorithm for unification in ShLin^ 
which directly works with multi-binding substitutions. However, since the algorithm 
for single binding substitutions is already quite complex, we think this is not worth 
the effort. 
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5.4 The domain Sharing x Lin 



The reduced product ShLin = Sharing x Lin has been used for a long time in the 
analysis of aUasing properties, since it was recognized that the precision of these 
analyses could be greatly improved by keeping track of the linear variables. Among 
the papers which consider the domain ShLin, we refer to (jHans and Winkler 1992|) 
and (|Hill et al. 2004p . Actually, these papers also deal with freeness properties, 
which we do not consider here, to further improve precision. Although the domain 
ShLin has been used for many years, the optimal unification operator is as yet 
unknown, even for a single binding substitution. We provide here a new abstract 
operator for ShLin, designed from the abstract unification for ShLin^, and we prove 
that it is optimal for single binding substitutions. 

The domain ShLin keeps track of linearity by recording, for each object of 
Sharing, the set of linear variables. Each element is now a triple: the first compo- 
nent is an object of Sharing, the second component is an object of Lin, that is, the 
set of variables which are linear in all the sharing groups of the first component, 
and the third component is the set of variables of interest. It is immediate that 
ShLin is an abstraction of ShLin^ (and thus of ShLin"). In the following, we briefly 
recall the definition of the abstract domain and provide the abstraction function 
from ShLin^. 

ShLin = {[S,L,U] \ S C p{U), (S* ^ ^ e S*), L D [/ \ vars{S), U G p/(V)} , 

with the approximation relation <si defined as [S, L, U] <si [S', L', U'] iS U = U', 
S C S' , L D L'. There is a Galois insertion of ShLin into ShLin'^ given by the pair 



where is the 2-sharing group which has the same support of B, with linear 
variables dictated by the set L. In formula: 



The functional composition of a^^, 02 and asi gives the standard abstraction 
map from substitutions to ShLin. We still use the polynomial notation to represent 
sharing groups, but now all the exponents are fixed to one. Note that the last 
component U in [S, L, U] is redundant since it can be retrieved as LU vars{S). This 
is because the set L contains all the ground variables. 



In order to obtain a correct and optimal abstract unification over ShLin, the trivial 
way is to directly compute asi(jn.gu.2{'ysii[S, L, U]), x/t)). However, we prefer to give 



of maps: 



asi{[S]u) = [{M I o e 5}, {x e C/ I Vo e 5. o{x) < 1}, U 
Jsi{[S,L,U])^[{Bl\B eS}]u , 




00 ifB eU\L, 

1 if B e L, 

otherwise. 



5.5 Abstract unification for Sharing x Lin 



On the interaction between sharing and linearity 



31 



an unification operator similar to the other operators for ShLin in the Hterature 
dHowe and King 2003[ [Bagnara et al. 2005[ IHill et al. 2004)) . As for the domain 



ShLin^, we now provide the notions of multiphcity and linearity over ShLin. 

Given a set L of linear variables, we define the maximum multiplicity of a sharing 
group o in a term t as follows: 

L( ,N _ /E„eo occ(w,t) iior\vars{t)CL 
I oo otherwise 

According to the similar definition for 2-sharing groups, given [S, L, U] £ ShLin, we 
say that {S, L) is linear for a term t when for all o e 5 it holds that x\/i{Oi i) < 1- 
Note that, when t is a variable, the definition boils down to check whether t E L. 

Given X C rel{S, x,t), we fix the set L of linear variables and partition X in 
three subsets X^ = {o e X \ Xm(o, t) = 0}, Xt ^ {o e X \ Xm(o, a;) = 0} and 
Xxt = X \ {Xx U Xt). Moreover, we need to define the following subsets of X: 

Xf^ ={BeXt\xii{B,t) = oo}, ={BeXt\xii{B,t)eN}, 
Xf^ ={BeXt\x'!(iiB,t) = l}, X>' ={BeXt\xiiiB,t)>l}, 

X=t^ ={Be Xxt I xUB, t) = 1}, X>i = {Xxt I xiiiB, t) > 1}. 

Since we do not deal with definite linearity, we need to take into account the sharing 
groups which can be obtained by linearizing variables. This may be accomplished 
by using the set U instead of L when computing the multiplicity. We denote by 

the set 

Xg^{BeX,t I xliiB,t) = l} , 

which corresponds to the linearizable sharing groups. 

Moreover, given sets Ai, . . . , An with n > 2 we denote by bin(Ai, . . . , An) the 
set {lj{ai, . . . ,a„} | ai G Ai, . . . , a„ e A^}, by A* the set {IJB | i? C A} and by 
the set {IJ B \ B C A, B 0}. This notation slightly deviates from most of 
other literature on Sharing, where A* does not include the empty set. We prefer 
to adopt a double notation, namely. A* and A'^ , which is more standard in the rest 
of the research community. 



Definition 5.15 (Abstract unification algorithm for ShLin) 

Given [S, L, U] G ShLin and the binding x/t such that vars{x/t) C U, we define 

mgu,;([^, L, Uix/t) = [{S \X)UK,U'U L\ U] , 

where X = rel{S,x,t) = {B e S \ Bnvars{x/t) ^ 0} and U' = U\vars{{S\X)UK). 
Here, K is the set of new sharing groups created by the unification process and U' 
is the set of variables which do not appear in any sharing group of the result, i.e. 
the set of ground variables. K is defined as follows: 
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Ux e L: 



hm{XtUm,X>\X+,X:^)U 

bin({{o} U (UZ) I o e Xf\ ZCX,,1< \Z\ < xiAo, <)}, (^.f )*)u 
{X^t^ ■ 
• lix(^L: 

K = bin(X>i U X>\X, U X*)U 
bin((Xr^)+,X,UX=i,(X=/)*)U 

Finally, the set L' of linear variables which are not ground is 



(6) 



(7) 



L' 



L \ {vars{Xx U X^t) H vars{Xt U Xxt)) if (S, L) is linear for x and t, 
L \ vars{Xx U Xxt) otherwise, if {S, L) is linear for x, 

L \ vars{Xt U Xxt) otherwise, if {S, L) is linear for t, 

L \ vars{X) otherwise. 



(8) 



Theorem 5.16 {Optimality o/mgUj;) 

The operator mgUj,; in Definition 15.151 is correct and optimal w.r.t. mgu, when 
vars{x/t) C U . 



Example 5.17 

Let 5 = {^,xv,xy,zw},L = {v,w,x,y} ,U — {v,w,x,y, z] and consider the bind- 
ing x/f{y,z). It is easy to check that {S,L) is linear for x but not for t. Ap- 
plying our operator, we obtain ingUgi{[S, L,U], x/ f{y, z)) = [S',L',U] with S' — 
{9},xy,vwxyz,vwxz} and L' = {w}. This is more precise that the operators for 
Sharing x Lin in (|Hans and Winkler 1992) . Actually, even using the optimizations 
proposed in ( |Howe and King 2003[IHill et al. 2004[) . one obtains as result the object 

[{vxy, vwxz, xy, wxyz, vwxyz}, {w}, U] . 

The optimization proposed in ( [Bagnara et al. 2005[ ) is not applicable as it is, since it 
requires vars{rel{S, x)) and vars{rel{S, f{y, z))) to be disjoint. Even assuming that 
this test for independence may be removed as unnecessary, the final result would 
be the same as above. In both cases, our operator is able to prove that vxy and 
wxyz are not possible sharing groups. 

Note that, in a domain for rational trees, the sharing group vxy is needed for 
correctness, since the unification of {x/ f{f{v, y), c), z/w} with the binding x/ f{y, z) 
succeeds with {x/ f{f{v,y),c), z/c,w/c,y/ f{v,y)}. This means that we are able to 
exploit the occur-check of the unification in finite trees. As a consequence, our 
abstract unification operator is not correct w.r.t. a concrete domain of rational 
substitutions ( |King 2000D . □ 
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An alternative would be to compute the abstract unification following Theorem 
I5.9l with xm and W replaced by xf/ and U respectively (we can obviously ignore the 
delinearization operator (_)^ since B U B = B). However, we do not pursue further 
this approach. 

In the case vars{x/t) ^ U , we may proceed as for ShLin" and ShLin^: enlarge the 
set of variables of interest in order to include all vars{x/t) and compute unification 
with mgu^;. 

Definition 5.18 {Abstract unification algorithm with extension in ShLin) 

Given [S, L, U] G ShLin and the binding x/t, let V = {vi, . . . , u„} be vars{x/t) \ U. 

We define: 

mgu^iilS, L,U],x/t) ^ mguJ[S U {vu. .. ,Vr^},LUV,U UV],x/t) . 
Theorem 5.19 [Optimality o/mgu^; with extension) 

The operator mgu^; in Definition 15 . 1 81 is the optimal abstraction of mgu. 

Although the abstract operator mgUj,; is optimal for the unification with a single 
binding, the optimal operator for a multi-binding substitution cannot be obtained 
by considering one binding at a time. This is a consequence of the fact that the 
corresponding operator for single binding unification on ShLin^ cannot be extended 
to an optimal multi-binding operator by simply considering one binding at a time. 
In fact, all the counterexamples in Section [5. 31 are also counterexamples for mgu^;, 
since it is the case that [S]u = ^si{oisi{[S]u))- 



6 Optimal unification in practice 

In this section, we give some evidence that there are practical advantages in using 
the optimal unification operators for ShLin. It is far beyond the scope of this pa- 
per to provide an experimental evaluation of the new algorithms, but the results 
m gnara et al. 2005|) give some hints on its possible outcome. Bagnara et al. 



(2005) introduce an improvement for Sharing x Lin x Free exploiting some ideas 
from King's unification operator for the domain ShLin^. In this way, they improve 
precision in a few cases and show that efficiency of the analysis is more likely to be 
increased than decreased. In fact, even if the final result of the analysis does not 
change, a more precise operator may reduce the number of sharing groups in the 
intermediate steps, which helps performance. Hence, we expect the optimal unifica- 
tion for ShLin to further improve the analysis, both in efficiency and precision. This 



is more evident if we consider that Bagnara et al. (2005 ) measure precision in terms 



of the number of independent pairs (as well as definitively ground, free and linear 
variables) and do not consider set-sharing. However, Bueno and Garcia de la Banda 
(2004) show that set-sharing information may be useful in several application of the 
analysis, such as parallelization of logic programs. Hence, a greater improvement in 
precision is to be expected if we consider the full set-sharing property. 

We now provide a concrete example of a simple program where our abstract 
operators give better results than the operators known in the literature. 
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6.1 An example: difference lists 

We work with difference lists, an alternative data structure to lists for representing 
a sequence of elements. A difference list is a term of the kind A \ B where A and 
B are lists, which represents the list obtained by removing B from the tail of A. 
For example, using PROLOG notation for lists, [1,2,3,4] \ [3,4] represents the list 
[1,2], while [l,2,3|a;] \ a; and [1,2,3] \ [] represent the list [1,2,3]. The difference 
lists whose tail is a variable (such as [1, 2, 3|a;] \ a:) are mostly useful, since they can 
be concatenated in constant time. An overview of difference lists may be found in 
( [Sterling and Shapiro 1994D . 

We define the predicate diffiist/i, which translates lists to difference lists and 
vice- versa. The goal ^ difflist{l, h, t) succeeds when the difference list h\t represents 
the standard list /. For example, diffiist{[],x,x) and di fflist {[1,2, 3], [1, 2, 3\x],x) 
succeed without any further instantiation of variables. In order to improve the pre- 
cision of the analysis, we keep head and tail of difference lists in separate predicate 
arguments. The code for diffiist/S, in head normal form, is the following. 

diffiist{l,h,t) ^l = W,h = t. 

diffiist{l,h,t) ^ I ^ [x\l'],h= [x\h'], difflist{l' ,h' ,t). 

where 1,1' (list), h,h' (head), t (tail) and x are variables. We informally compute 
the goal-independent analysis of diffiist on the domain ShLin, which gives: 

idiffiistf =m,{i,h,t},{i,h,t}] , 

Idiffiistf =[{0, ht, hi], {I, h, t], {I, h, t]] , 
Idifflistf ^{diffiistf . 

The result of the analysis is not affected by our improved unification operator: 
the standard mgu for ShLin, as given in (IHans and Winkler 1992 p . yields exactly 
the same result. Now, suppose we want to analyze the goal ^ diffiist{l, h, h). This 
corresponds to the goal <— diffiist{l, h,t), h ^ t in head normal form. Its semantics 
may be computed, using our operators, as 

mgu,i([{0, ht, hi}, {I, h, t}, {I, h, t}], h/t) = [{0, ht], {/}, {/, h, t]] . 

By projecting over I and h, we get [{0, h}, {l}, {l, h}]. Hence, the analysis is able to 
infer that, upon exiting the goal <— diffiist{l, h, h), the variable I is ground. 
By using the standard mgu for ShLin in (IHans and Winkler 1992p . we get 

[{$,ht,htl],{l],{l,h,t]] , (9) 

hence I is detected to be linear but not ground. The optimizations introduced 
in dHowe and King 2003] IHill et al. 2004| [Bagnara et al. 2005D do not improve 
this result. This is a consequence of the fact that these optimizations have been 
developed to be correct also for rational trees. In this case, you cannot infer that 
I is ground after ^ difflist{l,h,h) since the substitution in rational solved form 
{^/[t'], /i/[w|/i]} is a correct answer for the same goal. 

If we perform the analysis in ShLin^, using our operators we have '^diffiist\ = 
[{%,hl,ht}]iht and the result for the goal <— diffiist {I, h,h) is [{0, /i}]z/i. However, 
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by using the original operator in ( [King 1994P , the semantics of difflist does not 
change, but the resuh for the goal <— difflist {l,h,h) is [{%,h°° ,h°°l°°}]ih thus I is 
not proven to be either ground or linear. 

The fact that optimal operators improve groundness information is somehow 
surprising. Generally, one expects that groundness affects aliasing analysis, but not 
vice- yersa. In fact, it is well known that Sharing is a refinement ( Cortesi et al. 
1997^ of the domain Def . However, as far as groundness is concerned, the precision 
of Sharing and Def is the same, i.e., the other objects included in Sharing do 
not improve groundness analysis (ICortesi et al. 1998|) . As far as we know, there is 
no abstract unification operator in the literature, for a domain dealing with shar- 
ing, freeness and linearity, which is more precise that Def for groundness. On the 
contrary, the example above shows that ShLin, endowed with the optimal unifica- 
tion, improves over Def. Amazingly, in this example ShLin is even better than Pos 
( [Armstrong et al. 1994[ ). In the latter, the abstract semantics of difflist is h <^ {IM), 
i.e., h is ground iff both I and t are ground. The result of the analysis for the goal 
^ difflist{l, h, h) is 3t {h ^ {I At) Ah ^ t). This is equivalent to h ^ I which does 
not imply groundness of I. Actually, h I is the groundness information which 
may be inferred by ([9|). 

6.2 Another example for ShLin^ 

As far as we know, there is no implementation or experimental evaluation of the 
domain ShLin^. We think it would be worthwhile to give such an implementation 
and that there is some evidence that ShLin^ improves over ShLin also in practice. 
For instance, we show a simple program where King's domain is more precise than 
ShLin with optimal operators. 

We provide a variant of the predicate difflist /3, which we call difflist' /2, with 
only two arguments: head and tail of the difference list are encoded in the second 
argument as the term head \ tail. 

difflist' {I, d) ^l=[],d=h\h. 

difflist' {I, d) ^1= [x\l'],d= [x\h]\t,d' = h\t, difflist' il' ,d'). 

We informally compute the goal-independent analysis of difflist' on the domain 
ShLin, which gives: 

idifflist'f ^m,{d,i},{d,i}] , 

Idifflist'f =[{0,dZ,d},{/},{d,/}] , 
Idifflist'f =ldtfflist'f . 

The same analysis, computed over ShLin^, gives 

{difflist' f =mu , 

idifflist'f =[{%,dl,d,d°°}U , 
Idifflist'f ^idifflist'f . 

Now, suppose we want to analyze the goal ^ difflist' {I , d) , d = [xi,X2\h] \ t, 
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which extracts the first two elements from the difference list d. In ShLin we have 
the following 

mgu,i([{0, dl, d}, {/}, {d, /}], X2\h] \ t) = 

[{%} U bin({dZ, d}, {xi,X2,h, t}*), {/}, {d, Z, xi, X2,h, t}] . 

Note that the sharing group dlxiX2 is part of the result. If we repeat the analysis 
in ShLin^, we have 

mgU2i[{9,dl,d,d°°}]di,d/[xi,X2\h] \ t) = [{9, dlxi, dlx2, dlh, dlt} U 

i{|+Jx I X e p{{d-xT,d-xT,d-h-,d-'tnm,,^^^^,, . 

This result does not contain the sharing group dlxiX2- 

Generally speaking, it is easier to analyze the predicate dijflist/3 than dijflist' /2. 



Codish et al. (2000 1 propose a method named untupling which is able to automat- 



ically recover dijflist/3 from dijflist' /2. 



7 Related work 

In this paper, we work with a concrete domain of substitutions on finite trees. In 
the literature, some authors deal with rational trees. 

Since any correct operator for rational trees is also correct for finite trees, we can 
compare the unification operators for rational trees with ours (of course, this is not 
entirely fair as far as the precision is concerned). The opposite is not true, since an 
abstract unification operator for finite trees may be able to exploit the occur-check 
condition. We have shown in Example 15.171 that our optimal operator can exploit 
the occur-check condition, and thus it is not correct for rational trees. 



7.1 Sharing 

It is well-known that the abstract unification operator of the domain Sharing alone 
(i.e., without any freeness or linearity information) is optimal. Cortesi and File 
(1999) give a formal proof of optimality, considering a slightly different unifica- 
tion operator with two abstract objects and a concrete substitution. Since the two 
abstract objects are renamed apart, it is equivalent to consider a single abstract 
object. The basic idea underlying the proof is to exhibit, for each sharing group in 
the result of the unification, a pair of concrete substitutions generating the resulting 
sharing group. We follow the same constructive schema in the proof of optimality 
for ShLin" (but we look for a single substitution, due to the different concrete oper- 
ator). Instead, to prove optimality for ShLin and ShLin^, we use a direct approach 
and show that the abstract unification operator corresponds to the best correct 
abstraction (i.e., a o mgu,^ o 7) of the unification on ShLin'^ with simple (although 
tedious) algebraic manipulations. 

A different unification operator has been proposed in (jAmato and Scozzari 2002] 
lAmato and Scozzari 2009|) for goal-dependent analysis of Sharing. In this paper, 
the standard unification operator is splitted into two different operators for forward 
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and backward unification. Both operators are proved to be optimal and the overall 
analysis is strictly more precise than the analysis performed on Sharing equipped 
with the standard operator. 

As far as we know, these are the only optimality results for domains encoding 
aliasing properties. 



7.2 Sharing x Lin 



In most of the work combining sharing and linearity, freeness information is included 
in the abstract domain. In fact, freeness may improve the precision of the aliasing 
component and it is also interesting by itself, for example in the parallelization 
of logic programs ( [Hermenegildo and Rossi 1995D . In this comparison, we do not 
consider the freeness component. 

The first work which combines set-sharing with linearity is ( [Langen 1990[ ), fol- 
lowed by (Hans and Winkler 1992"]) . The initial unification algorithm has been 



improved by Howe and King (2003) and Hill et al. (2004) by removing an inde- 
pendence test. This increases the number of cases when linearity information may 



be exploited. Bagnara et al. (2005) propose a different improvement, adopting an 
idea by King (1994) for the domain ShLin^, which simplifies the unification of a 
linear term with a non-linear one. Example l5.17l shows that, even adopting all these 
improvements, we still obtain a strictly more precise operator. Since our operator 
is optimal, any further improvement is now impossible. 

Bagnara et al. (2002[ ) show that, if we are only interested in pair-sharing informa- 



tion. Sharing is redundant. They propose a new domain SS'' which is obtained by 
discharging redundant sharing groups. A sharing group i? in a set S is redundant 
if \B\ > 2 and yx,y G B. 3C G S. {x,y} C C C B. Analyses performed with SS'' 
are shown to be as precise as those performed with Sharing, if only pair-sharing 



information is required. Hill et al. (2004) introduce the domain SS'' x Lin x Free. 



Example 15.171 shows that our operator is still more precise (of course, without con- 
sidering the freeness component) , because of the sharing group vxy which does not 
appear in S' and is not redundant for SS''. In any case, Bueno and Garcia de la 
Banda (2004) have shown that classical applications of sharing analyses, such as 
parallelization of logic programs, are able to exploit information which is encoded 
in Sharing x Free but not in SS'' x Free. 

An alternative presentation of Sharing x Lin, based on set logic programs, has 



been introduced by Codish et al. (2000). However, the proposed operators are not 



optimal, as shown in IHill et al. 20041 

The domain ShLin^ is introduced by King (1994), which provides correct opera- 



tors for abstract unification. However, these operators are not optimal, as Examples 
and EH show. 



7.3 ASub 

An alternative approach to aliasing analysis is to only record sharing between pairs 
of variables (and possibly linearity and groundness information). The best known 
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domain of this category is ASub, introduced by S0ndergaard (1986) and formalized 



by Codish et al. (1991 ). The domain ASub is the reduced product of pair-sharing, Lin 



and Con ( Jones and S0ndergaard 1987[ ), which is the simplest domain for definite 



groundness. Recently, King (2000) reformulated the proofs in order to work with 
rational trees. Moreover, King's algorithms are parametric w.r.t. the groundness 
domain, allowing to replace Con with more precise domains such as Def and Pes. 

The domain Sharing x Lin is strictly more precise than ASub, since it embeds 
more groundness information (equivalent to Def) and set-sharing information. Since 
our operator for Sharing x Lin is optimal, we are sure that the analyses performed 
in Sharing x Lin are strictly more precise than those in ASub. 

The following is a counterexample to the optimality of the abstract unification 
in ( [King 2000] ), in the case of finite trees, when pair sharing is equipped with Def 
or Pos. 

Example 7.1 

Consider the object n — [x ^ y, {xy}) where the first component is a formula of Def 
and Pos and {xy} is the set of pairs of variables which may possibly share. In this 
domain, linearity information is embedded in the second component in the following 
way: if v is not linear, then vv must be included in the second component. Thus, both 
x, y and z are linear in (x <-> y, {xy}). We want to unify k, with x/ f(y, z). By using 
the algorithm ( [King 2000| , we obtain {y <-> x/\x z, {xy, xz, yz, xx, yy}). However, 
in Sharing x Lin we may represent k with [S, L, U] = [{xy, z}, {x, y, z}, {x, y, z}] 
and mgU3;([S', L, U],x/ f{y, z)) = [{xy}, {z}, {x, y, z}] which proves that z is ground. 
□ 



Actually, King (2000) does not state explicitly how to compute the groundness 
component of the result, although he says that it must be computed before the 
linearity and pair-sharing components, in order to improve precision. However, it 
seems safe to assume that the author's intention was to compute the groundness 
component using the abstract operators already known, and therefore indepen- 
dently from the pair sharing component. This is what makes our operator more 
precise, since linearity information may help in tracking ground variables when 
working over finite trees. 



7.3.1 Alternating paths 

The domain ASub and its derivatives ( [King 2000[ ) use the concept of alternating 
path. Alternating paths may seem the counterparts, for pair-sharing, of sharing 
graphs. We now investigate this idea, and show to what extent this correspondence 
is faithful. 

We call carrier graph a special graph defined by a set of equations E. Each 
distinct occurrence of a variable in _E is a node. Edges in the carrier graphs can be 
of two types: 

• edges of type one between two variable occurrences if the occurrences are on 
opposite sides of a single equation in E, 
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• edges of type two linking two (distinct) occurrences of the same variable. 

An alternating path is a sequence of edges of alternating type over the carrier graph. 

Alternating paths in ASub (and derivatives) are used to prove correctness of 
the abstract unification operators. For example, they are used to prove Prop. 3.1 
in ( |King 2000p . Sharing graphs are used in this paper to prove Theorem 14.311 
which is the starting point to prove correctness and optimality of the unification 
algorithms for ShLin^ and ShLin. However, sharing graphs are also used to compute 
the abstract unification in ShLin'^. Even if alternating paths are not used, in the 
literature, for computing abstract unification, they could. For any object of pair- 
sharing o, which is a set of pairs of variables, consider any substitution 9 in the 
concretization of o. Then, the object o is an abstraction of the set of alternating 
paths in 0. More precisely, it represents all the paths which start and end with edges 
of type one, which we call admissible paths. They are abstracted by considering only 
the start and end variables. In order to unify o with the binding x/i, we build a 
carrier graph with all the occurrences of variables in o and x/t. For each pair of 
variables in o, we add an edge of type one. We add edges of type one and two for 
the binding x/t, as explained above. Finally, we add all the type two edges between 
any occurrence in x/t and any occurrence of the same variable in o. We consider 
all the admissible paths over the graph so obtained. It is not difficult to check that 
the result of the unification algorithm for pair-sharing in ( [King 2000[ ), without any 
additional groundness domain, is the set of all the start and end variables for all 
these admissible paths. 

Example 7.2 

Let S — {xv} be the set of pairs of variables which share, and consider the binding 
x/r(y, y). We obtain the carrier graph: 



y 

which gives origin to several alternating paths. Among them, there is an admissible 
path from w to w, which proves that v is not linear after the unification. □ 

The first difference between alternating paths and sharing graphs is that all the 
alternating paths are subgraphs of the same carrier graph, while each sharing graph 
has a different structure, with a different set of nodes. The second difference is that 
the information coming from the abstract object and the binding is encoded in a 
different way. For instance, consider the set S = {xy} and the binding x/z. We 

1 1 

obtain a carrier graph with 4 nodes x, y, x, z, two edges x y and x z 

of type one, and an edge x x of type two. Therefore, the sharing information 

coming from the initial pair-sharing and the binding is treated symmetrically, and 



y 
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is entirely encoded on the edges. Performing unification on the carrier graph boils 
down to devising the alternating paths on the graph. On the contrary, each sharing 
graph has a set of nodes labelled by xy, x and z, with suitable multiplicities. The 
labels of the nodes encode the initial pair-sharing information, while the binding 
affects the multiplicity of nodes. The process of unification consists of adding the 
necessary arrows to get a sharing graph. 

If we consider a single alternating path in a carrier graph and the sharing graph 
for the same pair-sharing information and the same binding, they are obviously 
related, although not in a straightforward manner. Consider an admissible path 
and delete all type two edges, collapsing in a single node their start and end nodes 
(type two edges are used in the carrier graph to avoid the creation of invalid paths, 
but in a single alternating path they do not add information). Then, each type one 
edge coming from the initial pair-sharing information corresponds to a node in the 
sharing graph, while a type one edge coming from the binding becomes an arrow 
in the sharing graph. 

Example 7.3 

Consider Example 17.21 We depict the (admissible) alternating path from u to u, its 
collapsed version and the corresponding sharing graph. 



■y y- 



XV 



y 







XV 



Note that, while in the carrier graph, non-linearity of the variable x is handled by 
duplicating the variable y which occurs twice, in alternating paths without type 
two nodes, the duplicated variables are x and v, which are connected to y. The 
same holds in the sharing graph, where we have only one node labeled by y and 
two nodes labelled by xv. □ 

In sharing graphs we also require the multiplicities of a node to be equal to 
its in- and out-degrees. This makes possible to handle groundness at the same 
level of sharing and linearity, without requiring a separate domain. Remember that 
a sharing group S with multiplicity n corresponds, in the concrete domain, to 
a variable u such that 0~^{u) — S. If the degree of the node labeled with this 
sharing group is not n, this means that one of the occurrences of u is bound to a 
ground term. This would make ground the entire connected component containing 
S. Hence, in order to correctly and precisely propagate groundness, we just forbid 
this kind of sharing graphs. On the contrary, the pair-sharing algorithm in ^ King 
2000), which focus on a single path in the carrier graph, is not able to extract 
groundness information without the help of an auxiliary domain. 
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7.4 Lagoon and Stuckey's domain 

[Lagoon and Stuckey (2002) have recently proposed a different approach to pair- 
sharing analysis. The authors use multigraphs, called relation graphs, to represent 
sharing and linearity information. The nodes of the multigraph are variables, and 
two of them may share only if there is a traversable path from one variable to the 
other. Intuitively, each binding generates edges of different types. The definition of 
traversable paths is very similar to that of alternating paths. A traversable path is 
a sequence of edges, such that contiguous edges are always of different types. 

This domain should be coupled with a groundness domain, and operators are 
parametric w.r.t. the latter one. The authors show that relation graphs, when cou- 
pled with the Def groundness domain, are more precise than Sharing and ASub. 
However, this is not the case for Sharing x Lin, at least in the case of finite trees, 
since the operators in ( [Lagoon and Stuckey 200"2| ) are not able to use linearity to 
improve the precision of the groundness component. 

Example 7.4 

As shown in Example 17. li if we unify [S, L, U] — [{xy, z}, {x, y, z}, {x, y, z}] with 
the binding x/f{y,z), we obtain mgn^i{[S, L,U],x/ f{y, z)) = [{xy}, {z}, {x,y, z}], 
proving that z is ground after the unification. In the domains iloef and fipos of 



(Lagoon and Stuckey 2002), the abstract object corresponding to [S, L, U] is 



V ,x^y) 



IJ.1= [X- 

Intuitively, the first element of /ii encodes the sharing information, namely, that a; 
and y may share (while z does not share neither with x nor with y). The second 
element of /ii is an element of Pos (and also of Def) and denotes the groundness 
information that x is ground if and only if y is ground. 

The unification of ^1 with x/ f{y, z) in Opos is realized by abstracting the substi- 
tution and composing the two abstract object. The abstraction of x/ f{y, z) is 

\ 



M2 




,x^ {y Az) 



The first element says that x shares with both y and z, while y and z do not share. 
The second element says that x is ground if and only if both y and z are ground. 
The abstract conjunction is 

\ 




(a; <-> y) A (x — » z) 
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where edges drawn in different styles are compatible, namely, that they come from 
different bindings. From this result, it is not possible to infer that z is ground after 
the unification. □ 



In the actual implementation. Lagoon and Stuckey (2002 ) use another representa- 



tion for their domain. Each pair of variables is annotated with a formula denoting 
the groundness models under which the corresponding pair-sharing may occur. For 
example, a pair uv annotated with the formula u A v A w A z means that u and 
V may share only if none of u,v,w,z is ground. We conjecture that this domain 
may be embedded in King's ShLin^. The next example shows how to perform this 
embedding. 

Example 7.5 



We consider the example in Figure 4 in ( Lagoon and Stuckey 2002 1 . The variables 
of interest are u, v, w, z. 



uw 
vz 
uv 
wz 



u Aw 
V A z 

u A V A w A z 
w A z 



uz 
uu 
vv 
vw 



u A z 

u Aw A z 

V Aw A z 

V Aw 



For instance, uv:uAvAwAz means that u and v may share only if u, w, w, z are 
not ground, while uu : u Aw Az means that u is (possibly) not linear only if u, w, z 
are not ground. Each of these formulas may be viewed as a condition over 2-sharing 
groups. For example uv : u A v A w A z means that every 2-sharing group which 
contains u and v should also contain w and z, while uu : u Aw Az means that each 
2-sharing group where u is non-linear should also contain w and z. In order to find 
the object of ShLin^ which corresponds to this example, it is enough to collect all 
the 2-sharing groups which satisfy all the conditions enforced by the formulas. In 
this case, we get l{u°°v°°wz, u°°wz, v°°wz, uw, vz, wz, uz, vw, u, v, w, z}. □ 



7.4-1 Traversable paths 

The idea behind traversable paths is very similar to the concept of alternating path 
and relation graphs are quite similar to carrier graphs. From a carrier graph, we 
can obtain a relation graph by removing type two edges and introducing a different 
type of edge for each binding. This works because the use of non-linear terms is 
forbidden: a binding like x/r{y, y) has to be replaced by two bindings x/r{y, z) and 
y/z. However, the main difference w.r.t. traditional pair-sharing (and also ShLin") 



is that Lagoon and Stuckey (2002 ) do not abstract traversable paths to set of pairs 
of variables, but they keep in the abstract object the set of all the edges generated 
during the unification process. In this way, they are able to record that, in order 
for two variables x and y to share, the only possible path touches another variable 
z. Hence, if z is ground, x and y cannot share: in this way they recover pair sharing 
dependence information which would be lost otherwise. 

We could follow the same approach and use multilayer sharing graphs (namely, 
sets of sharing graphs over the same set of nodes, where each layer represents the 
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unification with a single binding) as abstract objects, without collapsing them to 
sharing groups. We do not think this would improve precision of the domain very 
much, since a sharing group is already a much more concrete abstraction of a graph 
w.r.t. the set of all the connected pairs of variables. In fact, already Sharing can 
encode the information that, grounding a certain variable two variables x and 
y become independent. Moreover, in the Example 17.51 we have shown that relation 
graphs may be encoded into ShLin^. 



7.5 Rational trees 

In the recent years, many authors have studied the behavior of logic programs on 
rational trees ( |King 2000[IIIill et al. 2004|) . which formalize the standard implemen- 
tations of logic languages. We have shown that our operators, which are optimal for 
finite trees, are not correct for rational trees, since they exploit the occur-check to 
reduce the sharing groups generated by the abstract unification (see Example EH?]). 
It would be interesting to adapt our framework to work with rational trees, in order 
to obtain optimal operators also in this case. Since a rational tree may contain infi- 
nite occurrences of a variable, the notion of w-sharing group needs to be extended in 
order to allow infinite exponents. Also, we need to consider infinite sharing graphs 
(or, at least, a representation of them) and find suitable regularity conditions for 
them, analogously to the regularity conditions on rational trees. 

Example 7.6 

Consider the set of w-sharing groups S — {xy,z} and the binding xlr[z^y). On 
rational trees, unifying 5 = {x/y} (such that [■S'Jxyz » [(^iKyz) with x/r{z,y) would 
get the substitution {x /r{z^x),y /r[z^y)} in rational solved form. This, intuitively, 
corresponds to the sharing group x'^y^ z where the exponent oj denotes an infinite 
number of occurrences. A possible (infinite) sharing graph generating this sharing 
group is the following: 



xy 



xy 



xy 



□ 



Although the structure of abstract objects and operators for adapting ShLin'^ to 
work with rational trees is more complex, we expect the optimal abstract operators 
for rational trees on ShLin^ and Sharing x Lin to be simpler than those presented 
here for finite trees. This is because we do not need to worry about the occur-check 
condition (embedded in our unification operator) and infinite multiplicities. 



8 Conclusion and Future Works 

We summarize the main results of this paper: 

• We define a new domain ShLin'^ as a general framework for investigating 
sharing and linearity properties and provide the optimal unification operator. 
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• We show that ShLin'^ is a useful starting point for studying further abstrac- 
tions. We obtain the optimal operators for single binding abstract unification 
in Sharing X Lin and ShLin^, and we show that these are strictly more precise 
than all the other operators in the literature for the same domains. 

• We show, for the first time, an optimality result for a domain which combines 
aliasing and linearity information. 

Moreover, as a negative result, we prove that the standard schema of the iterative 
unification algorithm (one binding at a time) does not lead to optimal operators 
for the domains ShLin^ and Sharing x Lin. As a side result, we show that ShLin 
and ShLin^ with optimal operators may be more precise than Pos for groundness 
analysis. 

Several things remain to be explored: first of all, we need to study the impact on 
the precision and performance obtained by adopting the new optimal operators and 
domains. We plan to implement the operators on ShLin^ and Sharing x Lin within 
the CiaoPP static analyzer (|Bueno et al. 19971) . Moreover, we plan to analyze the 
domain SS'' x Lin ( [Bagnara et al. 2002] ) in our framework and, possibly, to devise 
a variant of ShLin^ which enjoys a similar closure property for redundant sharing 
groups. This could be of great impact on the efficiency of the analysis. Last but not 
least, we plan to translate our framework to the case of unification over rational 
trees. 

A Proofs of Section [4] 

In this section we give the proofs of the main results of the paper. 
Theorem 

The relation o is well defined. 
Proof 

It is enough to prove that {6i~^{v)\u | w £ V} = {62~^{v)\u \ v £ V} when 
9i ~(7 02- Assume that 9i ~j/ 02, then by definition of ~[/ there exists a renaming 
p such that p{9i{u)) = 92{u) for each u £ U. Given S = 9i^{v)\u, if w = p{v) we 
have 62^{w)\u = 6^^{v)\{j = S. This concludes the proof. □ 

Proposition \4-13\ 

Given a substitution 9, a variable v and a term t, we have that — 
occ{v,9{t)). Moreover, given a set of variables U, when vars{t) C U, it holds that 
Xi9-Hv)\u,t) = occiv,9it)). 

Proof 

Let B = 9^^{v). The proof is by induction on the structure of the term t. If 
t = a is a constant, then occ{v,9{a)) = occ{v,a) = which is equal to x{B,a,) 
since occ{w,a) — for each w €V. If t = w is a variable, then occ(v,9{'w)) = 
9~^{v){w) = B{w). At the same time, x{B,t) = B{w) since occ{w,w) = 1 and 
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occ{y,w) = for y ^ w. For the inductive case, ii t = /(ii, . . . , t„), we have 
occ{v,t) = X]r=i occ{v,ti) — X^iLi x{B,ti) by inductive hypothesis. Moreover 

n n n 

x(B,t)= iB{v)-Y,occ{v,t,)) = J2 E B{v)- occ{v,t,)=Y,x{B,t,) . 

veW_BW 1=1 1=1 velBW i=i 

Let [/ be a set of variables with vars{t) C U. By definition, x{(^^^{''^)\u,'t) ~ 
J2w£e-'^(v)\u occ{w,t). Since vars{t) C U, for any w ^ U it holds that occ{w,t) — 0, 
and thus x(6'-i(w)|[/,<) = xid^Hv),t). □ 

Proposition \4-24\ 

Given substitutions 9, rj £ ISubst and an w-sharing group i?, we have 
Proof 

Using the definitions and simple algebraic manipulations, we have 

= >^w.x{Xv-x{B,v{v)),d{w)) 

= Xw. ^ xiB, v{y)) ■ occ{y, e{w)) 
y 

= Au;. ^ I ^ B{x) ■ occ{x, riiy)) \ ■ occ{y, e{w)) 
y \ X / 

= Xw. B{x) ■ occ{x, rj{y)) ■ occ{y, 0{w)) 

X y 

= Xw.^Bix) • ^?r\2;)(y) • occ{y,e{w)) 
X y 

= Xw.Y,B{x)-x{v'\x),e{w)) . 

X 

By Prop. [4?T3l we have that x(^~^ (^), ^('^)) = occ{x,r}{6{w)) and therefore 

e-\i^-\B)) = {ijo9)-\B) . □ 

Theorem \4.25\ ( Correctness of mgu^^ ) 

The operation mgu^^ is correct w.r.t. mgu, i.e., 

W[S]u eShLln"^, S e ISubst. [S]uc>[e]u =^ mgu^([5][/, (5) o mgu([6l]c/, (5) • 
Proof 

Given [S'][/o[0][/ and (5 £ ISubst, we need to prove that mgu^([S'][/, (5)omgu([0];7, (5) 
or the equivalent property auj{mgu{[0]u , S)) <uj mgU(j([S']c/, (5). 

Since mgu^^ is defined inductively on the number of bindings in 5, it is enough to 
prove that mg\i^{[S]u , x / 1) o mgu{[9]u, {x/t}) for a single binding x/t. Since com- 
position of correct operators is still correct, it follows that multi-binding unification 
is correct. 
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Moreover, when vars(x/t) % U, we exploit the identity iagn{[6]u,{x/t}) = 
mgn{mgn{[e]u,[e]yars(x/t)),{x/t})- When computing ingu{[e]u,[e]yarsix/t)) all the 
variables in vars{x/t) \ U occurring in 9 are renamed apart from x/t itself. There- 
fore each V € vars{x/t) \ U is free (hence linear) in mgu([^](7, [^\vars{x/t))j i-e-, 

a^{mg\x{[e]u, [e]vars{x/t))) = [S V^ {iv'} I V G vars{x/t) \ ?7}]c/u™rs(.;/t) • 

Therefore, it is enough to prove that mgu^^{[S]u,x/t) o mgu{[9]u,{x/t}) when 
vars{x/t) C U. Let B be a sharing group in ai^{mgu{[0]u,{x/t})), we prove that 
B G mgviJ[S]u,x/t). 

If S = J, we consider a multigraph G with only one node labelled by -g^]!- and 
no edges. It is easy to check that G is a sharing graph for S (since -S^J £ S*) and 
x/t, and that res{G) = Therefore, in the following we consider only the case 

The proof is composed of three parts: first, we look for a (special) substitution (} 
obtained by renaming some variables in 9 and such that /3 is still approximated by 
S] second, we define a multigraph G exploiting the variables of /?; third, we show 
that we can restrict G to a smaller sharing graph whose resultant w-sharing group 
is exactly B. 

First part. Without loss of generality, we assume that dom(0) = U (this is 
always possible since, in any class [6]u, there exists a substitution whose domain 
is exactly U). Let 9' = u\g\i{9 , {x / 1\) = -q o 9 with rj = mgu({0(x) — 9{t)}) and 
we have [9']u = mg\\{[9]ij, [x/t]u). Since dom{9) — U, we have vars{rj) Ci U ~ 
Consider 77' obtained from 77 by replacing each occurrence of a variable in rng(ry) 
with a different fresh variable. This means that there exists p G Subst mapping 
variables to variables such that p{r}'{x)) = ri{x) for each x G dom(77). Namely, we 
have 

p = {vi/v2 I 3x G dom(?7), ^ G S s.t. ?7'(a;)(^) = vi A r]{x){^) = V2} ■ 

Note that p is not a renaming, since it is not bijective. We now show that (3 = -q' o9 
has the property that [S]u o [P]u- For any C G a{[P]u), we may distinguish three 
cases: 

• C = {^}.ln this case G G 5 by definition of ShLin'^; 

• G = P~^{w)\u for w G mg{9) \ dom(?7). In this case occ{w, (77' o 6){v)) = 
occ{w,9{v)) for each w G V, therefore (3^^{w)\u — 9^^{w)\u e S; 

• C = P~'^{w)\u for w G rng(?7'). Hence there exists v G rng(6') such that 
occ{w,ri'{v)) = 1 and occ{w,r]'{v')) = for each v' ^ {v',w}. Hence, for 
each u e U, occ{w,r}'{9{u))) = n iff occ{v,6{u)) = n and this implies G = 
9-Hv)\ueS. 

Moreover p{(}(u)) = 9'{u) for each u £ U, therefore 9' ~[/ po P- 

Second part. Consider the labelled multigraph G such that Nc = {v \ v € 
vars{P{U))}, laiv) = p-\v)\u G S and Eg = \ f3{x){0 G V}. Note that if 
G ^7 then /3(t)(^) G V, too. Each position ^ in Eg is an arrow such that 
srcG(^) = P{x){^) and tgt(3(^) = P{t){^). Observe that the second condition in the 
definition of sharing graph for S and x/t is satisfied, since [S]u o [/?][/. 
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Let us check the third condition. For each node v G Nq, if x{l^~^ {'*^)\u , x) = n by 
Prop. [47T3l we have occ(w,/?(x)) ~ n, i.e., there are n positions in P{x) corresponding 
to V. Therefore the outdegree of v is n. In the same way, we have that x(P^^i''^)\ui t) 
is the in-degree of v. 

Third part. Given B = e'-^{u)\u, by Prop.g^llwe have B = /3-^{p-^{u))\u. 
Since 9' diu P diu 0, then [9% = mgu([/3][/, = [mgu(/3, {x/i})](7- Therefore 

po(j' r^uO' ^ nigu(6l, {x/t}) r^u mgu(/3, {x/t}) = nigu(/3(a;) = /3(t)) o (3. We cah (5 
the result of mgu(/3(x) = P{t)), and note that = /3(t) is equivalent to the set of 
equations X — {vi = V2 \ there is a position ^ such that (3{x){£,) = vi A f3{t){£_) — 
V2}- The relation p o f3 S o f3 means that, if wi,W2 G l3{U) and p{wi) = p(w2) 
then (5(u'i) = <5(u'2). The latter implies that there are in X equations of the kind 
Xi = X2, X2 — x^, . . . , Xn-i ~ Xn with xi — Wi and a;„ ~ W2, i.e., that wi and W2 
are connected in the graph G. 

Therefore, let Y — {w \ p{w) = u] = [[/9^^(u)JJ. This is not empty, since B ^ 
If ^ is an edge such that stcg{C) G then tgtQ(^) e F, since = I3{t){£,) £ 

X. The converse also holds. Hence, if we restrict the graph G to the set of nodes Y , 
we obtain a sharing graph whose resultant cj-sharing group is l+l^^gy = 
f3-^{p-\u))\u^B. □ 

Theorem \4-28\ ( Optimality of mgu^^ ) 

The single binding unification TcigVL^(\S]u , x / 1) is optimal w.r.t. mgu, under the 
assumption that vars{x/t) C [/, i.e., : 

VB e nign^{[S]u,x/t) 35 E ISubst. [S]u o [5]u and B e a^(mgu(Ht/, {x/i})) ■ 

Let X e mgUjj(5, x/i). By definition of nigu^^, there exists a sharing graph Q such 
that X e res(C/). Let Ng — {ni, . . . , n^}. We want to define a substitution 5 such 
that [SJc/ o [5]u and X e ai^(mgu([(5];7, {x/i})). If X = this is trivial, hence 
we assume that X ^ The structure of the proof is as follows: first, we define a 
substitution 5 which unifies with x/t] second, we show that 5 is approximated by 
[S]u, namely, [S]ij o [S\u] third, we show that X G ai^{-nig-a{[S\u , {x/t})). 

First part. We now define a substitution 5 which unifies with x/t. For each node 
n e Ng we consider a fresh variable Wn and we denote by W the set of all these 
new variables. 

For any y G U \ {x} we define a term ty of arity X^neA's follows: 

ty — r(^Wji^ , . . . , , , . . . , t ■ ■ • : : ■ • ■ : ) 

" V ' " V ' V ' 

lg{n-i){y) times lg{n2){y) times times 

We know that there exists a map / : Eq V such that, for each variable y 
and node n, the set of edges targeted at n and labelled with y by / is exactly 
lg{n){y) ■ occ{y,t). Namely, we require 

{e e Eg I /(e) = y Atgtg(e) = n}\ = lg{n){y) ■ occ{y,t) . 

The idea is that each edge targeted at the node n is actually targeted at one of 
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the specific variables in Ig (n) . In particular, each variable y G (n) JJ should have 
exactly lg{n){y) ■ occ{y,t) edges targeted at it, so that the total number of edges 
pointing n is '^y^ulg{n){y) ■ occ{y,t) = x(^s('^))^)) i-e-, the in-degree of n. The 
map / chooses, for each edge targeted at n, a variable in lg{n) according to the 
previous idea. 

Now, for each node n and variable y G U, we denote by Mn^y the set of edges 
pointing at y in n, i.e., Mn,y = {e € Eg \ tgtQ{e) = nA /(e) = y}. Thus M„_y may 
be partitioned in occ{y,t) sets of lg{n){y) elements, denoted by Mn,y,^ such that 

U{Mn,y,i I t{0 =y} = Mn,y. 

We may define some variations of the terms ty by replacing the variables occurring 

in them with those in the set Mn,y,i - In particular, for y G U\{x} and any occurrence 
^ of a variable y in t, we define the term t| of arity ^Q{^){y) ^ 

= r{w{Mn,^y^^),w{Mn^^y^^), . . . ,w{Mn^^y,^)) , 

where, if M = {ei, . . . , e,}, we define 'w{M) as the sequence w„'^ , . . . , Wn'^ where 

Note that ty and have, in corresponding positions, variables related to nodes 
which are connected through edges. We are now ready to define the substitution 5 

in the following way: 

• 5{x) is the same as t with the difference that each occurrence ^ of a variable 
y Gt is replaced by the term t^; 

• iov y £U\ {x} then 6{y) = ty-, 

• in all the other cases, i.e v ^U, S{v) = v. 

Second part. Now we show that [5]t/o[(5]i7. We need to consider all the variables 
V gV and check that S~^{v)\u G S. We distinguish several cases. 

• If we choose the variable u>„ for some n G A'^, by construction occ{wn,ty) = 
lg{n){y). Moreover, since ^ is a sharing graph, there are lg{n){x) edges in E 
departing from n and targeted to nodes m such that xihi^)^^) 0- Thus 
T,yevars{t)-meNg ^ M^JsTCEaie) = n}\ = lgin){x) and occ(5(.t), w„) = 
lg{n){x). Since for each v G U we have that occ((5(u), w„) = lg(n)(v), we 
obtain the required result which is 6~^{w„)\u = lg(n) G S. 

• If we choose a variable v gU then v G dom{S) and S~^{v) = -^J G S. 

• Finally, ifv^UUW, then S-^{v) = iv} and 6-'^{v)\u = i}GS. 

Third part. We now show that X G aaj(mgu([5]c/, {x/t})). By definition of mgu 
over ISubstr^, we have that mgu([(5]i7, {x/t}) = [mgu{S, {x/t})]u. We obtain: 

T] =mgu(5, {x/t}) = 

{x/t} o mgu({y = t,,\yGU\ {x}} U {y = i| | t{0 = v}) = (Al) 
{x/t} o {y/ty \yGU\ {x}} o mgu{ty = t| | t{0 = y} ■ 

Let F be the set of equations {ty = t^ | t{j) = y}. We show that, for any edge 
n m G Eg, it follows from F that Wn = Wm- Since n ^ m G Eg, then for some 
y G vars{t) it holds that f{n m) = y. This implies that n ^ m G Mm,y and 
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therefore there exists a position ^ such that n ^ m G Mm^y,^- By definition of 
it means that w„ £ vars{t^), in the same position where Wm occurs in ty, hence 
Wn — Wrn foUows from ty = t^ E F . 

We know that Q is connected, hence for any n,m € Ng , the set of equations in 
F imphes Wn = Wm- We choose a particular node n G Ng and, for what we said 
before, we have mgu(i^) = {wn/wn \ n e Ng \ {n}}. We show that r]~^{'Wn)\u — N. 

ri^^{Wn)\u 

- {x/t]-\{y/ty \yeU\ {x}}-\iwn„. . .,w„J))\u 
^{x/t}-\iwn„...,WnJ\±iXyeU\{x}. h{n){y))\u 

neNg 

= XyGU\{x}. h{n){y) y|a;^.ev°-fe.*)-E„e«g's(«)to)j 

neNg 

= XyeU\{x}. Y Wfx^-^^s^'^'^'"^'*^;^ . 

neNg 

Since is a sharing graph, the total out-degree J2neNg x(^e("-)i is equal to the 
total in-degree J2n£Ng xihi'n'):^)- Hence 

^XyeU\{x}. Y h{n){y)lSlx^"--o^^'^^'^^'^^J 

neNg 

= XyeU.Y h{n){y) 

neNg 

— res{Q) . 
This concludes the proof. □ 

Theorem \4-.SC\ [Optimality of mgu^ with extension) 

The single binding unification mgu^ with extension is optimal w.r.t. mgu. 
Proof 

Let S' = SU{lvl I V e vars{x/t)\U}, V = UUvars(x/t) and X £ mgu^(S", x/t). 
We want to find [S]u such that [S]u o> [S]u and X G atj(mgu([(5](7, {x/t})). 

Following the previous theorem, we find 6 such that X G ai^(mgu([(5]y , {x/i})) 
and [S']v c> [S]v- We want to prove that [S]u o [S]u and ai^(mgu([(5]y , {x/t})) <^ 
auj(jng[i{[S]ij, {x/t})), so that [d]u is the existential substitutions we are looking 
for. 

We first show that [S]u c> [S]u- Let v e V. Since [S']v c> [S]v, it follows that 
6-\v)\v e S'. 

• If 6~^{v)\v G 5", then 5^^{v)\v = S^^{v)\u, since vars{S) C U, and thus 
S-Hv)\ueS. 

• liS-'^{v)\v i S, then^-i(w)|y e {fv} \ v £ vars{x/t)\U}. Then S-\v)\u = 
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Now we distinguish two cases: either x € U oi x ^ U. 

li X E U, with the same considerations which led to (IAl|l . we have 

mgu{{x/t}, d) = mgu({x - <} U Eq{S\u) U Eq{S\y\u)) = 

= mgu{{x = t} U Eq{5\u) U {y ^ ty \ y e vars{t) \ U} 

For each y S vars{t) \ U there exist a position such that = y and 

{x/i}UEq((5|c/)U{?/ = ty} is equivalent to {x/t}UEq{S\u)U{^^ = ty}. Note that, 
since y ^ U, then ij, (which is actually 5{y)) is linear and independent from x/t 
and the other bindings in 5. Therefore 

mgu({a; = t} U Eq((5|c/) U {y = t^, | y £ vars{t) \ U} 
=mgu({a; = t] \J Eq((5|c/) U {t^^ = ty \ y e vars{t) \ U] 
=mgu({x = t} U Eq((5|c/)) W /?' 

where /?' = mgu({i| = ty \ y E vars{t) \ U}) and dom(/3') = t;ars({tj^ | y e 
vars{t) \ U}). It follows that 

a^([mgu({a; = U Eq(5|c/)) W /3» 
=a^([mgu({a; = U Eq(5|c/))]y) 
=a^{u\gn{[S\u,{x/t})) . 

li X ^U, then 

mgu({x/t}, 5) = mgu({x = i} U Eq((5|[/) U Eq{5\yaTs(t)\u) U Eq(^|{^}) 
=mgu({a; = t} U Eq((5|,7) u {y = t^, | y G ws(i) \ t/}U 

Note that x appears in S' only in the multiset Moreover, if n is a node labelled 
by §a;§, there is only one edge which departs from n and there are no edges which 
arrive in n. This means that 

• Wn does not appear in any ty for y eV \ {x}, 

• 6(x) is linear since given edges e ^ e', we have that src£;j,(e) ^ srcEcip')- 

As a result, 5{x) is linear and does not share variables with x/t or the other bindings 
in 5. The last formula may be rewritten as 

mgu({a; = t} U Eq{5\u) i^ {y ^ ty \ y & vars{t) \ C/}) W /3 

where is a substitution such that dom(/3) — vars{5{x)) C H^. It is obvious that 

a^([mgu({x = t} U Eq((5|[/) U {y = | y G vars{t) \ [/}) tt) /3]y) 
= a^([mgu({x = t} U Eq(5|c/) U {y = t J y G wars(t) \ . 

since dom(/3) n F = 0. 
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Let Ui = vars{t) \ U, then 

nigu({a; ^ t} U Eq{6\u) U Eq{6\u,)) 
=S\u o mgu(<5|[/({.T = t}U Eq{S\u,))) 
^S\u o mgui{x = d\uit)}) U Eqid\u,))) 

[since vars{S\ij^) vars{S\u) = and x vars{S\ui)] 
^S\uo{x/6\u{t)}oS\u, 

[since {a;} ^ ?;ars((5|t/j )] 

Note that (5|c/ o {a;/(5|;7(i)} is ingu((5|[/, {x/i}). We call 7 = 5|;7 o {x/S\u{t)} and we 
prove that ai^{['y]v) >w ^^([7 o S\ui]v)- 

Consider a variable z; S V. If w ^ vars{6\u^) there is nothing to prove. If 
V G rng(5|[/j) we know that v does not occur anywhere else in 6\u-^ and 7. Then 
(7 o = 7^"^('S2/, wS) = 7^"^ (2/) W "S^wJ for the unique y such that w G 

liars ((5 1 (y)). Therefore, since w ^ V^, the sharing group over V we obtain in 
7 o (5|[/j^ from f may be obtained in 7 from the variable y. If tj G dom((5|(7j) then 
(7 o <5|;7j^^('y) = I^J which occurs in every element of ShLin". □ 

Theorem\J^ 

Let S" be a set of w-sharing groups and a;/t be a binding. Then B G mgUj^(S', x/t) 
iff there exist n G N+, Bi, . . . , Bn G S which satisfy the following conditions: 

I. B = tt)i<i<„i3i, 

2- Ei<.<„ xiB,,x) = El<^<n x{B^,t) > n - 1, 

3. either n = 1 or VI < i < n. x{Bi, x) + x{Bi,t) > 0. 

Proof 

We first prove that these conditions are necessary. Assume that i? is a resultant 
sharing group for S and x/t, obtained by the sharing graph G. We show that there 
exist a finite set / and, for each i G /, a multiset Bi G 5*, which satisfy the above 
conditions. 

Take / = Ng and Bi = Icii) for each « G /, so that B = Wig/Bi. Since then 
in-degree of each node is x{Bi,x), the sum of the in-degrees of all the nodes is 
J2iei x{Bi,x) and the sum of the out-degree is J2iei x{Bi,t). Both of them must 
be equal to the number of edges in Eq. Moreover, each connected graph with |/| 
nodes has at least |/| — 1 edges. Finally, if a connected graph has more than one 
node, then every node i has an adjacent edge. Therefore, either x{Bi, x) or x{Bi, t) 
is not zero. 

Now we prove that the conditions are sufficient. Let /={!,..., n}. If n = 1 and 
X{Bi,x) + x{Bi,t) = for the only i e I, simply consider a sharing graph with 
a single node labelled with Bi and no edges. Otherwise, we partition the set / in 
three parts: 

• N., = {iel \ x{Bux) = 0}; 

' Nt = {iel\ x(B^,t) = 0}; 

. TV = {i G / I xiB^,x) ^ 0,x(B„i) ^ 0}; 
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Note that this is a partition oil since, by hypothesis, Mi G /. xi^i, x)+xiBi,t) > 0. 
Now we define a connected labelled multigraph G whose sets of nodes is / and whose 
labelling function is Xi G I.Bi. In order to define the edges, we distinguish two cases. 

N Let N — {61, . . . , 6^} with m > 1 and consider the set of edges: 

{a^hi\a^Nt}\J {61 -^c\ceN.^}\J {h h+i | i G {1, . . . , m - 1}} . 

N — If Nt = 0, then also = and there is nothing to prove. We assume that 
Nt ^ 0, and thus ^ 0. Let a ^ Nt, c & and consider the set of edges: 



Note that, in both cases, we obtain a multigraph with the following properties: 

1. it is connected; 

2. it has exactly n — 1 edges, i.e. it is a tree (if we do not consider the direction 
of edges); 

3. there is no edge targeted at a node i with xiht) — and no edge whose source 
is a node i with xih = 0- 

In the rest of the proof, we call pre-sharing graph a multigraph which satisfies the 
above properties. 

If indeg{i) is the in-degree of a node and outdeg{i) the outdegree, we call unbal- 
ancement factor of the graph the value: 

''^^{outdeg{i) ~x{Bi,x) \ i G I,outdeg{i) > xiBi, x)}+ 



We prove that given a pre-sharing graph with unbalancement factor fc, we can 
build another pre-sharing graph with unbalancement factor strictly less then k. As 
a result, there is a pre-sharing graph with unbalancement factor equal to zero. 

Assume that the graph has unbalancement factor k. There is at least an unbal- 
anced node. Assume without loss of generality that the unbalanced node is j and 
that outdeg{j) > x{Bj, x). Since X)ie/ x(^j; 2;) > n — 1, there exists a node I such 
that outdeg{l) < xiBi,x). Let e be the unique edge with source j such that, if we 
remove e from the graph, I becomes disconnected from j. Since no edge starts from 
a node i with xiBi,x) = 0, then xiBj,x) > 0. This means that outdeg{j) > 1 
and there is at least another edge starting from j. Assume that it is e' : j — > j'. 
Remove this edge and replace it with an edge / ^ j'- It is obvious that the result 
is a pre-sharing graph with a smaller unbalancement factor than the original one. 
The case for indeg{j) > x{Bj,t) is symmetric. 

Once the unbalancement factor is zero, since 'Ylii^i ^^Bi^x) — X^ie/ x(^ij 
can freely add other edges in such a way to complete the graph w.r.t. the condition 
on the degree of nodes. We obtain a sharing graph G such that res{G) = B. □ 

B Proofs of Section [5] 

In this section we give the proofs of correctness and optimality for the abstract 
unification operators mgU2 and mgu^j. 



{a ^ c I c G N^} [j{a^c\a&Nt\ {a}} . 




+ ^{mdeg{i) ~ x{Bi,t) \ i G I,indeg{i) > x{Bi,t)} 
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Proposition \5.1\ 

The following properties hold: 



1. a2(l+)7e) =l+)a2(7^). 

2. rel{-i2{S),x,t)) ^ -i2{rel{S,x,t)). 

Proof 

We begin by proving the first property. 
a2M-Bi,...,B„}}) 



Now we proceed with the proof of the second property. 



re/(72(5),a;,i)) 
= \J{l2{o) I o G 5, l72(o)| n vars{x = <) ^ 0} 

= \J{l2{o) I o e 5, H n «ars(a; = t) ^ 0} (since H - ll2m) 
= j2{rel{S,x,t))) . □ 

Theorem \5.S\ 

(02,72) : ShLin^ ^ ShLin'^ is a Galois insertion. 



It is obvious that a2 and 72 are monotone functions and that they are both join- 
morphisms. Extensionality of 72 o a2 follows from the fact that, given an w-sharing 
group B, we have B G 72 (0^2 (5))- Finally, given a 2-sharing group o, we have 
0^2(72(0)) — {o}. This implies that a2 o 72 is the identity. □ 

Theorem 15.51 

Given [S]u G ShLin'^ and the binding x/t with vars{x/t) C [/, we have that 



i{|+Jy I y G Pm{S'),n G x{Y,x)f^x{Y,t). n>\Y\- l]]u , 




W f oi, . . . , OnJ where Oi ^ Xv e 




WSa2(Bi),...,a2(S„)}} 



Proof 



mgn2i[S]u,x/t)^[{S\S')U 



where 5" = 



rel{S, X, t). 
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Proof 

By using Prop. EH] point [21 and since o ^ a' ^ 72(0) H 72(0') — 0, we get: 

a2{l2{S)\rel{-i2{S),x,t)) 
= a2{j2{S) \j2{rel{S, x,t))) 
= 0:2(72(6' \ rel{S,x,t))) 
= S \ rel{S, X, t) . 
Therefore, we get the equahty 

Tag\i2([S]u , X / 1) — S \rel{S,x,t)[J 

a2({W7^ I n e p^{reli-f2iS),x,t)), ^ xiB,x) = ^ x(S,<) > |7^| - 1} 

BeTZ Ben 

Now, with simple algebraic manipulations, we obtain: 

a2({HJ7^ I 7^ G p™(re/(72(S'),x,t)), ^ x(B,x) = ^ x(B,i) > - 1}) 

= a2({ttJ7^ I 7^ G p™(72(re/(S', x, t))), ^ x(S, ^) = I] x{B, t) > |7^| - 1}) 

B£Tz Ben 

= a2{{W{Bi,...,Bk}\keN, 

\/i.B,G-f2{reliS,x,t)), x{B,,^)= Yl x{B,,t) > k - I}) 

l<i<k l<i<k 

= a2({agBi,...,Bfc8 I k G N,|oi,...,Ofc8 G pm{rel{S,x,t)), 
Vi.B, G72(oO, x{Br,x)= x{B^,t) > k - 1}) 

l<i<k l<i<k 

= a2{{W^Bi, . . . , Bfc J I fc G N, (re;(S',a;,t)), 

Vi.a2(B0 = OM E x(S„x)= Y x(B.,t) > fc-1}) 
l<i:<fc l<i<fc 

(such Oi's do always exist since rel{S,x,t) is downworld closed) 

= i{a2(tU«Bi,...,BfeJ) I A: GN,|oi,...,Ofc8 G p™(re;(S, a^, t)), 

Vz.a2(B0 = o«, I] x(S„a:)= ^ x(B.,t)>fc-l} 
l<i:<fc l<i<fc 

= l{Wia2{Bi),...,a2{Bk)} 1 fc G N, -g; irel{S,x,t)), 
•ii.a2(B,) = o,, Y x{Br,x)= Y X[Br,t)>k-l} 

l<i<k l<i<k 

(by Prop. 15. II point [T]) 
= i{tU|oi,...,Ofc| I G N,goi,...,OfeJ G p^{rel{S,x,t)), 

•ii.a2(B,) = o,, Y x{Br,x)= Y X[Br,t)>k-l}} 

l<i<k l<i<k 

= l{Wioi,...,Ok} I G N,goi,...,OfcJ G pmirel{S,x,t)), 

\/i.a2(B^) = o^,\/i.a2{B',) = o„ Y x{Br,x) = Y x{B[,t)>k-l} 

l<i<k l<i<k 

(we discuss later why this is faithful) 
= i{W|oi,...,OfcJ I fc G N,|oi,...,OfcJ G pm{reliS,x,t)),n>k-l, 
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« e { ^ Bi{x) I Vi.a2(Bi) = Oi} n { ^ xiB'i,t) \ Vi.a2(B-) = Oi}} 

l<i<k l<i<k 

= i{l±lfoi,...,Ofcl I k € N,foi,...,Ofc| € pm{rel{S,x,t)),n>k-l 

l<i<fe l<i<k l<i<k 

The move from a single family {-Bi}i<i<fc to different families {-Bi}i<i<fc and 
{Bi}i<i<k is possible since, if 

Vi.a2(Bi) =0i andVi.a2(S^) =o, and ^ x(Si,a;)= ^ xiK^) > ^ - I , 

l<i<k l<i<k 

we may define a family {Ci}i<i<fe such that Ci{x) = Bi{x) and Ci{v) = B'{v) for 
each V ^ x.lt is immediate to check that the Cj's satisfy the condition 

Mi.a^{Ci) = Oi and ^ xiC^x) = ^ x{Ci,t) >k-l . 

l<i<k l<i<k 

If we denote with c{^oi, . . .,Ok^,t) the set { Z]i<j<fe x(-Bi,i) | Vi.a2(-Bi) = Oj}, 
what remains to prove is that 

i{|+JX I X e p,nireliS,x,t)),ne xi^ , x) n c{X , t) .n > \X\ - 1} 

= i{|+J X\Xe p^{rel{S, X, t)), n e x(^, n x(^, t), n > |X| - 1} , 

where the only difference is that wc replaced c{X,t) with x(X, f). 

We begin by examining the relationship between c(X, i) and x(X, t). First of all, 
it is obvious that c{X,t) C xi^^t)^ therefore we only need to prove half of the 
equality. 

If there exists o & X such that xm(o, t) = oo, then c{X,t) is an infinite set. We 
call n its least element. Under the same conditions, x(-^)i) is the interval [n, oo]. 
If there is no o € X such that Xm(o, = oo, then c{X,t) = x(X, i) and they are 

both singletons. 

In the same way, if there exists some o G X such that o{x) = oo then x(^) 
is an interval of the kind [n, oo). However, if there is no such o, then xi^^ is a 
singleton, whose imiquc element is |{o G X | o{x) = 

Assume that we have X e pm{rel{S,x,t)) such that there exists n G xi^^^) ^ 
xiX,t) with n > \X\ — 1. We want to prove that we may find a multiset Y G 
Pm{i'el{S,x,t)) such that there exists m > jFI — 1 with m G xO^:^) ^ c(F, and 
l±J -'^ < l±J This is enough to complete the proof of the theorem. 

We distinguish several cases. 

• x(^; ^iid xi^i t) are both infinite. In this case, c{X, t) is infinite. Moroever, 
since xi^^^) is an interval, there are infinite natural numbers in xi^^x) n 
c{X, t). We may take Y ^ X. 

• x{^^t) is infinite and xi^i^) is a singleton {«}, then v = \{o G X \ o{x) = 
1}| < k. Since it must be u > fc — 1, there are only two choices: either v = k 
ov V = k — 1. We distinguish the two subcases. 

— w = fc — 1. In this case, there exists o G X such that Xm{o, t) = and 
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o{x) — 1, otherwise it is not possible that v > Xm{X,t). Since x(X, 
is infinite, the same holds for c{X, t), hence we may find an n G c{X, t) 
such that n > V. Consider Y = X \+) {n — v) * ^oj. We have x(F, x) — 
{v+ {n- v)} = n, c(y,t) = c{X,t) and \Y\ = \X\ + n ~ v ^ n + 1. 
Therefore n G c{Y,x) fl c{Y,t) and n > |y | — 1. 1+J F is a valid result, 
and < 

— u fc. If there is a.n o E X such that Xm{o,t) — 0, the proof proceeds 
as in the previous case. Otherwise, Xm{X,t) > k and since it should be 
V = k > Xm{X,t), we have Xm{X,t) — k. Therefore k G c{X,t) too, 
since mm c{X,t) = mmx{X,t), and we may take Y — X. 

• if x(^, t) is finite, then xi^: t) ~ c{X, t) and we take Y — X. □ 
Theorem \5. 7| 

Given [S]u in ShLin^ and the binding x/t, let V — {ui, . . . ,w„} be vars{x/t) \ U. 
Then, 

mgU2([5']t/,a;/t) = mgU2([S' U {vi, . . ■ , Vn}]uuv , x / 1) . 

Proof 

First of all, given a finite set of variables V, let us define the extension operator 
exty : ShLin'^ ShLin'^ such that extvi[S]u) = [S U {fv} \ v e V\ U}]uuv- 
Given V = vars{x/t) \ U ~ {vi, . . . , Vn}, we have that 

mgU2 {[S]u,x/t) ^ a2 (mgu^ (72 ( [5] [/ ) , x/i ) ) 

= a2(mgu^(exty(72([S']!7)),a;/t)) . 

We also know that 

mgU2([5 U {vi, . . ■ ,Vn}]uuv,x/t) = 

a2(mgu^(72([5 U {ui, . . . ,Un}]c/uy), x/t)) . 

Hence, it is enough to prove that 

extv(72([S']c/)) = 12{[S U {vi, . . ■,Vn}]uuv) ■ 
By definition of 72 , we have that 
72([S' U {vi, . . . ,Vn}]uuv) 

= [[j{'l2{o)\oeSU{vi,...,Vn}}]jj^jy 

= [[jb2{o) I o e 5} U {lv^l, . . . , ivn}}]^^y [since j2{v^) = in,}] 
= extvh2{[S]u)) , 
which completes the proof. □ 
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Theorem 15.91 

Given [S]ij G ShLin^ and the binding x/t with vars{x/t) C U , we have 
mgU2([5]c/,xA) = U res{X,x,t)]u , 

XCS' 

where S' = rel{S,x,t) and res{X,x,t) is defined as follows: 

1. if X is non-linear for x and t, then res(X, x, t) = {1+J X^}; 

2. if X is non-linear for x and linear for < 1 and \Xt\ > 1, then we have 
res{X, X, i) = {(1±J X,) W (1+) y (1+) X?)}; 

3. if X is linear for x and strongly non-linear for t, \Xx\ > 1 and \Xt \ < 1, then 
we have resiX, x, t) - {(1+J X^) W (1+J X^,) y (J+J X*)}; 

4. if X is linear for x and not strongly non-linear for t, < 1, then we have 

res(X,a;,i) = {([+JZ)y(|+Jx2jy(|+JXt) | Z e p„(X,), 

|Z| =XAf(^t,t) =X™(^t,i), 

5. otherwise res{X,x,t) — 0. 
Proof 

By Theorem 15.51 we only need to show that: 
l{\^Y\Y€pm{S'),nexiY,x)nxiY,t).n>\Y\-l}=l (J resiX,x,t), 

XCS' 

(Bl) 

where S' = rel{S,x,t). We prove the two different inclusions separately. 

Left to Right Inclusion. Let o G res{X, x,t) for some X C rel{S, x,t). We 
want to prove that there exist Y e pm(S") and n G x(^;2;) fl x(l^, t) such that 
n > |y| — 1 and \±)Y = o. We distinguish several cases: 

• if X is non-linear for x and t, it is \+)X'^ = o. We distinguish two subcases: 

— if xm{X, t) = cxd, it is enough to take Y — X \+i X . 

— if XAiiX, t) is finite, since X is non-linear for t, there exists a' G X such 
that Xm{o', t) > 1. Since S" is downward closed, consider o £ S such that 
o{x) — min(o'(x), 1) and o{v) — o'{v) ifv^x. We show that there exists 
a natural number n such that, for Y = X^X^n^o^, we have XmiY, t) > 
Xm{Y,x) and Xm{Y,t) > \Y\ - 1. Since Xm{Y,x) < 2xm{X,x) + n, we 
need to solve the inequalities 2xm{X,t) 4- nXm{o,t) > 2xm{X,x) + n 
and 2xm{X,t) + nXm{o,t) > 2\X\ +n. Since Xm{o,t) > 2, there always 
exists a solution for n. Since xm{X,x) = oo, we have that J+Jy = o is 
in the left hand side of (jBip . 

• if X is non-linear for x and linear for t. We need to find m such that, if we 
take Y = X^ W 2Xxt W 2mXt, we have Xm{Y, t) > Xm{Y, x). In other words, 
we need to solve the disequation 2xmiXxt,t) + 2mXm{Xt,t) > Xm{Xx,x) + 
2xm{Xxt,x), which is always possible, since \Xt\ > 1. Since |y| < 1 + 2|X2;4|-|- 
2m\Xt\ we have XmiX,t) > \Y\ - 1. 
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• if X is linear for x and strongly non-linear for i, we distinguish two subcases: 

— Xm{X, t) = 00. Let n = 2xm{Xxt, t) + Xm{Xt,t) and consider any num- 
ber m such that 2m\Xx\ + 2\Xxt\ > n (such an m always exists since 
\Xx\ > !)• Then, consider the multiset Y = 2mXx W 2X^1 W Xt, and 
we have that Xm{y,x) = XM{y,x) = 2m\Xx\ + 2\Xxt\ > Xrn{Y,t) by 
construction. Moreover XM{Y,t) = oo and |y| < 2m\Xx\ + 2\Xxt\ + 1. 
Then (+) F S res{X, x,t) is a valid resultant sharing group. 

— Xm{X, t) is finite. Let o G X^t be a sharing group such that Xm(o, t) > 1 
and o' be a generic sharing group in X^- We need to find two natural 
numbers n and m such that, if we take Y = 2Xx W 2Xxt W l+) m\^o^ W 
n^o'\, we obtain XmO^^x) = XmO^^'t) (from this immediately follows 
Xm{Ytx) = XM{Y,t)) and Xm{Y,x) > \Y\ — 1. This means we need to 
solve the equations: 

2\Xx\+ 2\Xxt\ +m + n = 2xm(X^t, t) + Xm(^t, t) + mXm{o, t) 

2\Xx\ + 2\Xxt\ + m + n> 2\Xx\ + 2\X,,l\ + \Xi,\ + m, + n - 1 
Since \Xi\ < 1, the second equation is always satisfied. A solution for 
the first equation always exists, since the greatest common divisor of 
Xm{o,t) — 1 and 1 is 1. 

• if X is linear for x and X is not strongly non-linear for t, consider the multiset 

Y = ZiSX^t^ X^t W X,. Then x,n{Y,x) = XMiY,x) = \Z\ + 2\X,t\ and 
Xm{Y,t) = XM{Y,t) = 2\Xxt\ +Xrn{Xt,t). Sincc \Z\ = Xrn{Xt,t), we have 
that Xm{Y,x) = Xm{Y,t). Moreover, |y| = \Z\ + 2\X^t\ + \Xt\ < Xm{Xt,t) + 
2\X^t\ + l = Xm{Y,t) + l. 

Right to left inclusion. Let o = where X e prn{S') and there exists 
n > l-'^l — 1 such that n £ x{X, x) n x(X, t). We show that there exists Y C S' and 
o' G res{Y,x,t) such that o' >2 o. Let k = \X\. We partition X in three multisets 

Xx = X\{o\xM{o,t)=o}, Xt = X\^o\xm{o,x)=o} and X\xt = X\io\xM{o,t)>OAxM{o,t)>o}- 
Note that X^, Xt and X^t here are multisets and not ordinary set as in the definition 
of mgU2. We distinguish several cases: 

• if [[Xjj is hncar for x and strongly non-linear for t, then Xm{X, x) = Xm{X, x) 
= \Xx\ + \Xxt\ < k. Since Xm{X,x) > k — 1, there are two cases: either 
\Xx\ + \Xxt\ = fc — 1 or \Xx\ + \Xxt\ = k, which implies that \Xt\ < 1. 
Since [[XJ] is strongly non-linear for t, there exists o" G Xi ttl X^t such that 
XM{o",t) > 2, and thus Xm(X,t) > 2. Therefore Xm{X,t) > {X^tl- Since 
Xm(X, .t) = xm{X,x) > Xm(X, t), we have that \Xx\ > 1. It follows that 
o = l±J(X,WX,tWXO <2 (l±JM)'w(l±Jl^xt|)2w(l±)M) e res{lXlx,t). 

• if [[XJJ is linear for x and not strongly non-linear for t, then, as in the previous 
case we have \Xt\ < 1. Since X is not strongly non-hnear for t, XM{X,t) = 
Xm{X,t) = \Xxt\+XM{Xt,t). Moreover, xm(X, a;) = Xm{X,x) = \Xx\ + \Xxt\- 
By the condition n G x{X,x) D x{X,t), we get XM{Xt,t) = \Xx\. Therefore 
o <2 W (WM)' W (W^t) e re5(lX|,a:,i). 

• if [[XJJ is non-linear for x and t, then o <2 (l+l|iXjJ)^ G res ( [[X JJ, x, i). 
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• if |[^JJ is non-linear for x and linear i, the proof is symmetric to the one of 
the first case. □ 

Theorem\5lM 

Given [S]u £ ShLin'^ and the binding x/t with vars(x/t) C U , we have 

u,gn^{[S]u.x/t) = [{S\S')\M y [res[X,x,t)\Jres'[X,x,t))]u , 

XCmax S' 

where S' = rel{S,x,t) and 

' {1+J if X ^ X^t and 1{X) is linear for t , 

otherwise . 



res'{X, X, t) = 



Proof 

It clearly holds that: 

mgU2{[S]u,x/t)D[iS\S')Ul y {res{X,x,t)Ures'iX,x,t))]u (B2) 

XCmax S' 

since, for each X C max 5", if res'{X,x,t) is non-empty then l+JX^ may be gener- 
ated by the characterization in Theorem 15.91 It is enough to take X' = {l{o) | o £ 
X}, hence 1+) = 1+) X"^ G res{X' , x, t) according to the last case of Theorem l5.9l 

We prove the opposite inclusion. Let X C 5' and assume that X ^ max 5'. 
There exists X' C max 5" obtained by replacing each a £ X with h G max S' 
such that a <2 b. We have that \X'\ < \X\ since two different elements in X may 
be replaced with the same maximal element in X'. We want to prove that either 
res{X,x,t) = 0, or res{X,x,t) C lres{X' ,x,t) or res{X,x,t) C Ires' {X' , x,t). 
Therefore, we assume that res{X, x,t) ^ and compare the linearity properties 
(linear, non linear, strongly non-linear) of X' w.r.t. those of X . 

If they coincide, then it follows that res{X,x,t) C lres{X' ,x,t). This happens 
because both res{X, x, t) and res{X' , x, t) are obtained by the same case of Theorem 
15.91 However, note that X' may have less elements than X and therefore some 
variable which is non- linear in res{X, x,t) could be linear in res{X' , x,t). Actually, 
this never happens since the elements in X' which are not explicitly delinearized 
are either elements of the multiset Z in the third case of Theorem l5.9l fand therefore 
may appear multiple times) or elements of (X^) subject to the condition \Xt \ < 1 

(l^xl < 1). 

Assume that the linearity properties of X and X' do not coincide. The only 
interesting case is when X is linear for x and not strongly non-linear for t. In all 
the other cases, it is immediate from the definition that res{X, x, t) C lres{X' , x, t). 

If X' is not linear for x and for i, then it holds res{X^x,t) C |res(A"', x, t) by 
definition. 

If X' is linear for x and strongly non-linear for t, then it is immediate from the 
definition that res{X,x^t) C |res(X', x, t), provided that \Xx\ > 1. Otherwise, it 
must be \Xt\ = and therefore, in order to be res{X, x,t) ^ 0, we have X = 
Xxt and XM{X,t) — 1, which means 1{X') = X is linear for t. It follows that 
res{X,x,t) = {1+1X2} ^ res'iX',x,t). 
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If X' is not linear for x and linear for we show that \Xx\ < 1. Assume, by 
contradiction, that \Xa:\ > 1. Since X' is linear for t and \Xt\ < 1, then XM{Xt, t) — 
Xm{Xt,t) < 1, while |1[^JJ| — \Xx\ > 1, which is a contradiction. Thus it must be 
\X^\ < 1. If \X^\ = then \Xt\ = 0, hence res{X,x,t) = {l+jX^} and res{X,x,t) = 
res'{X,x,t). If \Xx\ = 1, since X' is linear for t, it follows that \Z\ — 1. Thus 
res{X,x,t) C |res(X', x, t). □ 

Theorem \5.16\ 

The operator mgu^; in Definition 15.151 is correct and optimal w.r.t. mgu, when 
vars{x/t) C U. 

Proof 

It is enough to prove that mgu^,; is correct and optimal w.r.t. mguj, namely, that: 

n'^gnsi{[S,L,U]),x/t) ^ asi{ingU2{'^si{[S, L,U]),x/t)) . 
Let 'jsi{[S,L,U]) = [T]u. By TheoremEU it holds that: 
a si (mgU2 {jsi {[S,L,U]), x/ 1) 
= a,i([(T\T')Ui U iresiY,x,t)Ures'iY,x,t)})]u 

yCmaxT' 

= asi{[T\T']uU2 □ ([ires(r,x,i)]c/U2 [ires'(y,x,t)}]a)) , 

yCmaxT' 

where T' = rel(T,x,t) and U2 is the lowest upper bound in ShLin^. By additivity 
of Usi, this is equivalent to 

asi{[T\r]u)Usi y {asi{[resiY,x,t)]u)Usiasi{[res'{Y,x,t)]u)) . (B3) 

YCmaxT' 

Let X, L', U' and if as in Definition 15.151 we have that mgu^;([S', L, t/], x/i) is 
equivalent to 

[{S\X)\JK,U' [JL',U] . (B4) 

We need to prove that equations (|B3p and (jB4p do coincide. In the rest of the paper, 
we assume that the result of (IB3p is [S"', L" ,U]. 

Sharing. We first prove that the Sharing components of the two equations are 
equal, i.e. that S" = {S\X)[JK . Given B e S" , there are several cases, li B = [[oJJ 
with oeT\T', then B e S\X. 

liB = [[o]J, for o e res'{Y, x, t) with Y C maxT', then B = U{M I o e F} with 

Y = Yxt and is linear for t.li x € L then is generated by since liY) 
is linear for t.li x ^ L there are two cases: if Y is linear for t then it is generated 
by {X=^)+, otherwise by hui{X>^ U X>^^,X^ iJX^uX*). Thus B e K. 

Now, assume that B = |[oJJ with o G res{Y,x,t) and F C maxT'. Then 
B = [_}W where W = {[[oJJ | o G F}. Since Y is made of maximal elements and 
\T]u = 72([S', i, ?7]), we have that Y is linear for a; iff a; £ L. For the same reason, 

Y is linear for t iff {W,L) is linear for t. As a. consequence, if Y is non-linear for t, 
then (X, L) is non-linear for t. 

We proceed by cases: 
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Y non-linear for x and t. Then res{Y,x,t) — {\^Y^}. Since {X,L) is non-linear 
for X and t, we have X>^ U X>^ ^ and X.;, U X^t 0- Thus B e bin(X>i U 

Y non-linear for x and linear for t. By hypothesis < 1 and \ Yt \ > 1, hence 
o=(l±J>".)W(l±Ji^.'t)W(l±J>;')and 

B e bin((Xr^) + , X, U X-\ {X=^r) C . 

In particular, B E hin{{Xf'^)+,Xx,{X=f^)*) when \Yx\ = 1, otherwise B e 
bin((Xri)+,X=i,(X=i)*). 
y linear for x and strongly non-linear for t. In this case we have that o = 
i^Yx) W (W^x^t) W (W^t) with lYrl > 1 and \Yt\ < 1. By definition of strong 
non-linearity, we have two cases: 

• there exists o e Y^t such that XM{o,t) > 1: in this case 

B e bin(X, U {0}, X>\ X+, XI,) C K ; 

• there exists o EYt such that XAf(o, t) — oo: in this case 

B e hmiX=°° , X+ , X;,) C . 

Y linear for x and non strongly non-linear for t. In this case 

o = (|+Jz')w(l+Ji;^,)w(|+Jy,) , 

with \Yt\ = 1, for some Z' G p™(yr) such that \Z'\ = Xm{Yt,t) and [[Z'JJ = i;. 
It is obvious that 

B e bin({{o} U (UZ) I o e z C X„ 1 < |Z| < xiAo, t)}, C , 

by choosing Z = {lo^ \ o e Z'}. 

This proves that if i? e S", then B e {S \ X) Li K. Now, we need to prove the 
converse implication. If i? e S\X, then i? = [[oJJ for some o £ T, and it is obvious 
that o€T\T\ hence B e S"'. 

Therefore, assume that B E K, and consider the case when x E L and B G 
bin(Xr°°, X+, We have that B = A U (UA') U {UA") for some G Xf"^, A' 

non-empty subset of X^ and A" C X^t- We may find o' G maxT', Y', Y" C maxT' 
such that [[o'Jl = A, [[F'Jj = A' and 1[F"JJ = A". We have that Y'" = {o'}ur'ur" is 
linear for x and strongly non- linear for t (due to the element o'), with > 1 and 
< 1. Therefore, we may apply the definition of res to obtain res{Y"' , x,t) = 
{o} with H = B, hence B G S". 

With similar reasonings, we may prove that for every B G K, we have B G S" . 
In particular: the second line of ^ corresponds to the case we choose a Y'" which 
is linear for x and strongly non-linear for t, due to an element o G Kj'" which 
Xm(o, > 1; the third line of ([6]) corresponds to the case Y'" is linear for X and 
is not strongly non-linear for t; the first line of ([7]) corresponds to the case Y'" is 
non-linear for both x and t; the second line of ^ corresponds to the case Y'" is 
linear for t and non-linear for x. 

Finally, if x ^ L and B G it is possible that B cannot be obtained as 
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resiY'" ,x,t) for any Y'" C maxT'. However, B may be obtained as res' (Y'" ,x,t), 
choosing Y'" as in the previous cases. The same happens if a; £ Land B € 

Linearity. We want to prove that L" = L' UU' . First of all, let us define Lg = 
U \ vars{mgn2{[T]u,x/t)) the set of ground variables in iaigU2{[T](j , x / 1) , hence 
L'^ C L". We are going to prove that U' = and L'\U' = L" \ L^. The first 
equality trivially follows from the fact that the sharing component of mgu^; is 
optimal, hence a variable occurs in a sharing group of S* \ 5 U iff it occurs in a 
2-sharing group of mgU2([T][/, x/t). 

Now, we consider a variable v ^ U \ U' , and prove that w e L' iff u G i". There 
are several cases. If we assume that w ^ L, by ([8]) we have v ^ L'. Moreover, if 

Y G maxT' and v e [[^JJ, by maximality of Y we have Y{v) = oo. Hence, by 
Theorem 15. 121 we have v ^ L" . If we assume that v ^ X, hy ([8]) we have v € L' iS 

V G L. Since vars{X) = vars{T), we also have w e L" iff w e L and therefore v G L' 
iff u e L". 

The only case it remains to prove is u G vars{X) n L which, combined with the 
condition v ^ U' , gives v G vars{K) f) L. First of all, note that if u G vars{Xxt) 
then V ^ V (by definition of V) and v ^ L" (since X^t) appears delinearized in 
every 2-sharing group resulting from res or res' . If w ^ vars{Xxt), we distinguish 
four subcases: 

• X G L and {S,L) linear for t. Given Y C maxT', checking the forth case 
of Theorem 15.91 when XM(^t,i) = 1, we have that res{Y, x,t) is not linear 
for u iff w e vars{Yxt) or w G vars{Yx) H vars{Yt). Note that there exists 

Y C maxT' s.t. v G vars{Yxt) U {vars{Yx) H vars{Yt)) iff w G vars{T'^f) U 
(mrs(T^) n vars{Tl)). Finally w G i" iff w G vars{T^^) U (mrs(T^) n vars{Tl)) 
iSvG (X,tU(X,nXt)) iff « G L'. 

• a; G L and (S*, L) not linear for t. Given Y C maxT', checking the third and 
forth cases (when XM{Y,t) > 1) of Theorem 15.91 we have that res{Y,x,t) 
non- linear for v implies v G vars{Yxt) or u G vars{Yx), which is equivalent to 

V G X^t U Xrc, i.e. V ^ L'. On the other hand, if w G X^, we distinguish the 
cases: 

— {S, L) strongly non- linear for t. There exists o E T' such that xm(o, t) = 

00 or o G T^t such that xm(o, t) > 1. Moreover, there exists o' G 
such that u G [[o'JJ. If we take F = {o, o'}, we have that res{Y,x,t) is 
not linear for v, hence v ^ L" . 

— {S, L) is not strongly non-linear for t. There exists o G such that 

1 < Xm(o, < cxo. Moreover, there exists o' G such that v G [[o'JJ. If 
we take Y' = {o, o'}, by the fourth case in the definition of res, we have 
res{Y, x,t) is not linear for v, i.e. v ^ i". 

• X ^ L and {S,L) linear for t. If w ^ i" then v G ?;ars(KE4) or v G t;ars(y4). 
This implies v G X^t^Xt, i.e. v ^ i'. On the other hand, if w G , there exist 
o GT'j. such that xm(o, = oo and o' G T/ such that v G [[o'Jj. By definition 
of res, we have that res({o, o'}, a;, y) is not linear for v, hence v ^ L" . 

• X ^ L and (5, L) non-linear for t. Since L' = X \ X, it is obvious that v ^ X'. 
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Moreover, there exist o E T' such that xm(o, — oo, o' G T' such that 
XM{o,t) > 1 and o" G T' such that v G l[o"Jj. Note that it is possible that 
o = o' = o" . By definition, we have res{{o, o' , o"}, x,t) is not Hncar for v, 
hence v ^ L" . □ 

Theorem \5.19\ 

The operator nigu^; in Definition 15. 181 is the optimal abstraction of mgu. 
Proof 

First of all, given a finite set of variables V, let us define the extension operator 
exty : ShLin^ ShLin^ such that exty([5];7) = [S* U {w | w e \ U}]uuv- Given 
V — vars{x/t) \ U, we have that 

asi{mgU2{jsi{[S,L,U],x/t)) = asi{extv{ragU2{'ysi{[S, L,U])),x/t)) . 

By Theorem 15. 161 we have that 

mgn,i{[S,L,U],x/t)=mgnJ[SUV,LUV,UUV],x/t) = 

a.iimgn^ijsiiiS UV,LUV,UU F], x/t))) . 

Hence, it is enough to prove that 

extvhsii[S,L,U])=j2i[SUV,LUVM^V]) . 

By definition of 72 , we have that 

j,i{[SUV,LUV,UUV]) = 
= [{Bluv \ BeS}U {Bluv I B G V}]uuv 
= [{Bl \ B<eS}U V]uuv [since vluv = v] 
= extv{lsi{[S,L,U]) , 

which completes the proof. □ 
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